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1 Introduction 



These lectures concern the development and uses of two properties, hyperbolicity and stability. 
Hyperbolicity has been used chiefly in geometric and analytic contexts, concerning wave-like par- 
tial differential equations, lacunas, and generalized Fourier transforms. Stability has its origins in 
control theory but has surfaced more recently in combinatorics and probability, where its algebraic 
properties, such as closure under various operations, are paramount. We begin with the definitions. 

Definition 1.1 (hyperbolicity). A homogeneous complex d-variable polynomial p(z±, Zd) is said 
to be hyperbolic in direction x £ W 1 if and only if 

p(x + iy) ^ for all y £ R d . (f.f) 

A non-homogeneous d-variable polynomial q with leading homogeneous part p is said to be hyperbolic 
if and only if p(x) ^ and there is some to > such that 

q{it* + y) ^ for all y £ R d and real t < t . (f .2) 

A polynomial is said to be hyperbolic if it is hyperbolic in direction £ for some £ £ M. d . 

Definition 1.2 (stability). The complex polynomial q is said to be stable if and only if 

> for all j = 1, . . . , d implies q(zi, . . . , Zd) ^ . (1-3) 

A real stable polynomial is one that is both real and stable. 

The following relation holds between these two definitions. 

Proposition 1.3. A real homogeneous polynomial p is stable if and only if it is hyperbolic in direction 
£ for every £ in the positive orthant. 

Hyperbolicity is a real geometric property, that is, it is invariant under the general linear group. 
It is not invariant under complex linear transformations, rather it distinguishes the real subspace 
of C d . Stability is invariant under yet fewer maps than is hyperbolicity, due to the fact that it 
distinguishes positive from negative. Stability is invariant under coordinatewise transformations of 
the upper half-plane, such as inversion z t— > —1/z and dilation z t— > Xz, from which one can get a 
surprising amount of mileage. 

The notion of hyperbolicity for polynomials has been around since its introduction by L. Garding 
sixty years ago |Gar51] H The notion of stability for real or complex polynomials in several variables 
goes back at least to multivariate versions of Hurwitz stability in the 1980's, but appears to have 
become established as tools for combinatorialists only in the last five years [COS WQ4l IBra071 IBBL09 , 
IGur08] . These two related notions have now been applied in several seemingly unrelated fields. 

3 Garding credits many of the ideas to I. G. Petrovsky in his seminal paper Tet45 . 
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Garding's original purpose was to find the right condition to guarantee analytic stability of a 
PDE as it evolves in time from an original condition in space; the notion of stability here is that 
small perturbations of the initial conditions should produce small perturbations of the solution at 
time t. That Garding found the right definition is undeniable because his result (Theorem l2.1l below') 
is an equivalence. His proof uses properties of hyperbolic functions and their cones that are derived 
directly from the definitions. Specfically, in order to construct solutions to boundary value problems 
via the Riesz kernel, an inverse Fourier transform is computed on a linear space {x + iy : y £ M. d }; 
hyperbolicity in the direction x is used to ensure that this space does not intersect the set {q = 0} 
where q~ a is singular. Garding developed some properties of hyperbolic polynomials in this paper 
and later devoted a separate paper to the further development of some inequalities |Gar59j . 

Twenty years later, with Atiyah and Bott [ABG70| , this work was extended considerably. They 
were able to compute inverse Fourier transforms more explicitly for a number of homogeneous hy- 
perbolic functions. To do this, they developed semi-continuity properties of hyperbolic functions. 
These properties enable the construction of certain vector fields and deformations, which in turn are 
used to deform the plane {x + iy : y £ M. d } into a cone on which the inverse Fourier transforms are 
directly integrable. Direct integrability provides useful estimates and also enables explicit computa- 
tion via topological dimension reduction theorems of Leray and Petrovsky. My introduction to this 
exceptional body of work came in the early 2000 's when I needed to apply these deformations to 
multivariate analytic combinatorics; refining the relatively crude techniques in [PW02, PWQ4] led 
to [BPTT] . 

In control theory, a continuous-time (respectively discrete-time) system whose transfer function 
is rational will be stable if its poles all lie in the open left half-plane (respectively the open unit disk) . 
Accordingly, we say a univariate polynomial is Hurwitz stable (respectively Schur stable) if its 
zeros are all in the open left half-plane (respectively the open unit disk) . Multivariate generalizations 
of Hurwitz and Schur stability have arisen in a variety of problems. The development of these 
generalizations splits into two veins. The review paper [Sok05 surveys a number of "hard" results 
on zero-free regions, meaning results that give good estimates on zero-free regions that depend on 
specific parameters of the models or graphs from which the polynomial is formed. Most relevant to 
the applications in Sections [6] and [7] are "soft" results, which give simple domains free of zeros (such 
as products of half planes), valid for all graphs or for wide classes of graphs. 

The development of this vein of multivariate stable function theory took place mainly in the area 
of statistical physics and combinatorial extensions thereof. Multivariate Schur stability arises in 
the celebrated Lee- Yang Theorem for ferromagnetic Ising models L YL52j ILY52] , where it implies the 
absence of phase transitions at nonzero magnetic field. Likewise, multivariate Hurwitz stability arises 
in electrical circuit theory |FB85[ ICOSW041 ISokllj and matroid generalizations thereof [CQSW041 
IBra07| IWW09| . and has applications to combinatorial enumeration |Wag06| [Wag 08|. It also arises 
in generalizations of the Lee- Yang Theorem |LS81j and in the Heilmann-Lieb Theorem for matching 
polynomials (also known as monomer- dimer models) |HL72| [COSW04]. It should be noted that 
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Hurwitz stability Hur96 , differs from the notion in Definition 11.21 in that the zero- free region is the 
right half-plane rather than the upper half-plane. For general complex polynomials the two notions 
are equivalent under a linear change of variables, but for real polynomials the two notions are very 
different. Except for brief portions of Sections 0] and [31 our concern will be with stability as defined 
in Definition 11.21 

Multivariate stability behaves nicely under certain transformations. The closure properties were 
investigated further by Borcea and Branden [BB09a . These closure properties turn out to have 
powerful implications for systems of negatively dependent random variables, as worked out in 2009 
by Borcea, Branden and Liggett BBL09 . My second brush with this subject was when I needed to 
apply some of their results to determinantal point processes |PP11| . 

These two encounters with hyperbolic/stable polynomials in seemingly dissimilar areas of mathe- 
matics prompted my search for a unified understanding. As pointed out at the end of the introduction 
of |BBL09 , such a viewpoint was taken 25 years ago by Gian-Carlo Rota. 

"The one contribution of mine that I hope will be remembered has consisted in just 
pointing out that all sorts of problems of combinatorics can be viewed as problems of 
the locations of zeros of certain polynomials..." 

to which we would add "and also problems in differential equations, number theory, probability, and 
perhaps many more areas." 

Presently, the greatest interest in the subject of hyperbolic/stable polynomials seems to be its 
potential for unlocking some combinatorial conjectues concerning determinants such as the Bessis- 
Moussa-Villani conjecture, Lieb's "permanent on top" conjecture and extensions of the van der 
Waerden conjecture, proved in 1979, but still an active subject (see, e.g., |Gur06j ). The distances 
between areas of mathematics in which hyperbolicity plays a role are such that it was difficult to 
find a primary subject classification for these lectures. It also poses a unique challenge for me, as I 
am an expert in none of these, save for applications to analytic combinatorics. I therefore apologize 
in advance for any historical inaccuracies perpetrated here, or for idiosyncratic viewpoints arising 
from my personal history with the subject. Here follows an attempt to lay out its two main pillars, 
hyperbolicity and stability, and to follow their progress up to the present day. 
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Part I: Hyperbolicity 



2 Origins, definitions and properties 

2.1 Relation to the propagation of wave-like equations 

Garding's objective was to prove stability results for wave-like partial differential equations. Endow 
the space of smooth complex valued functions of d real variables, denoted C°° (R d ) , with the topology 
of uniform convergence on compact sets of all the partial derivatives. For any polynomial q in d 
variables, let D[q] = q(d/dx) denote the linear differential operator obtained by formally substituting 
(d/dxi) for Xi in q. If we consider M. d as spacetime with the positive time direction given by a vector 
£, and denote by the hyperplane orthogonal to £, then the boundary value equation 

D[q](f) = 0, f = gonH 6 (2.1) 

on the halfspace {x : x • £ > 0} may be interpreted as the evolution of g under the equation 
D[q](f) — with initial condition g. One would expect m — 1 further initial conditions, such as 
(d 3 /dxQf = gj for 1 < j < m — 1, where (d/dx^) denotes a derivative in direction £ and m is the 
degree of q. 

We consider (|2.1[) to be stable under perturbations (in time direction £) if for all sequences 
{fn} of functions in C°°(M d ) satisfying D[q](f n ) = 0, convergence to zero of the restrictions of /„ to 

implies convergence to zero of /„ on the whole space. Intuitively, arbitrary small change in the 
initial conditions cannot lead to macroscopic changes in the evolution at a later time. The first of 
Garding's results can be stated as follows. 

Theorem 2.1 ( Gar51, Theorem III]). The differential equation (|2.1[) is stable in direction £ if and 
only if q is hyperbolic in direction £. 

It should be noted that the definition of hyperbolicity in direction x in Garding's 1951 paper 
is that q(tx + iy) ^ when y £ M. d and t > to. This does not appear to me to be equivalent to 
Definition 11.11 which appears in all later work including |ABG70] . The two definitions specialize 
to the same thing when q is homogeneous. Although the primary focus of these notes is on later 
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uses outside of PDE's, I will give indications of the proof. Not only does this satisfy curiosity and a 
historical sense, but it promotes the goal of using some of the geometric understanding when dealing 
with contemporary, more algebraic problems. 

To keep things simple, assume q is homogeneous. Examples will be discussed shortly, but for 
now, a good mental picture is to keep in mind the example d = 3 and q{x,y,z) = z 1 — x 2 — y 2 . 
Even though R d and its dual are isomorphic, I find it useful to classify a vector as belonging to the 
dual space (M. d )* rather than R d if it plays the role of a linear functional on the original space. For 
example the frequency £ of a wave x i— > exp(i£ • x) is thought of as living in (R d )*. 

Forward direction: For any £ € (R d )*, let denote the function x i-» e^' x mapping R d to 
C. Of course (M. d )* is isomorphic to M. d but we call it (R d )* to remind ourselves that £ runs over 
frequency space. When r is real, the function / r is a sinusoidal wave and is bounded, as in Figure [T] 
Let us assume without loss of generality that £ = (0, . . . , 0, 1). If q(£) = then /j is a solution to 



Figure 1: / r when r is real 

D[q](f) =0. If q is not hyperbolic in direction £ then there is some (r 1; . . . , r^-i) such that the 
d solutions to q(ri, . . . , r<2— i, t) = include at least one conjugate pair of values that are not real. 
Denoting by Td the one with the negative imaginary part, we see that / r (x) grows exponentially 
as Xd increases. By homogeneity, f\ T grows at A times the exponential rate. Choose c\ so that 
c\f\ r (0, . . . , 0, 1) = 1. Then f nr — > on the hyperplane in the topology of uniform convergence 
(due to periodicity, no compact set restriction is required), while f nr = 1 at (0, ... ,0, 1), proving 
the contrapositive, namely that lack of hyperbolicity implies lack of stability. □ 

Backward direction, handwaving argument: For fixed (r l7 . . . , r<j_i), the set {/ r : r e (M d )*} 
of solutions to D[q]f — spans a vector space V(r%, . . . r^-i) of dimension k. Since we are waving 
our hands, we have assumed strong hyperbolicity, namely that the d values of are distinct. 
Any integral / c(ri, . . . ,rd-i)dri ■ ■ ■ drd-i is also a solution, where c(-) is a section of V(-). The 
handwaving part is that this gives all solutions of D[q]f = 0. Assuming this, we write a generic 
solution / as such an integral. Each evolves unitarily, so more handwaving along the lines of a 
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Parseval relation shows that if / and its first k derivatives are small at time zero then the time t 
value is small as well. □ 



Removing all the handwaving takes considerable work, beyond our scope here. The key is the 
construction of the Riesz kernel, which is interesting enough to merit a brief digression. Suppose 
that q is TO-homogeneous and hyperbolic in direction x and let a is any complex number. Define 
the Riesz kernel Q a by 



We have not yet defined cones of hyperbolicity, but let us nonetheless try to visualize why the 
Riesz kernel is well defined and supported on the dual cone. To explain the terminology, the convex 
dual of a cone K C K d is the cone K* C (M d )* consisting of vectors y such that w • y > for 
all w 6 K. Duality maps fat cones to skinny cones and vice versa (see Figure [2]). The connected 



components of the open set {w (E M. d : q(w) 7^ 0} are cones. If convex they have duals. Let 
C be the component containing x. We will see later that hyperbolicity guarantees this is one of 
the convex ones. Hyperbolicity in direction x guarantees that q is nonvanishing on the domain of 
integration x + iM. . This is illustrated in Figure |31 where the dashed lines signify that the contour 
of integration is varying in imaginary directions only and therefore does not hit the surface {q = 0}. 
The magnitude of the exponential term is constant, whence the integral is convergent at infinity 




q(z) a exp(y ■ z) dz . 



(2.2) 




Figure 2: The cone K and its dual K* . 




Figure 3: Chain of integration in the definition of Q a 
when mRe{a} > 1 + d. Because the chain of integration avoids {q = 0}, integrability is assured 

G 



as is a well defined branch of q~ a . It follows easily from Cauchy's Theorem that the Riesz kernel is 
independent of the choice of x within the component C. Furthermore, if y G (W 1 )* is a vector for 
which some x g C has x • y < 0, then it is clear that the integral defining Q a (y) can be deformed 
so as to be arbitrarily small, hence is equal to zero. In other words, Q a is supported on the set of y 
for which there is no such x, which is precisely the dual cone C* . □ 

To complete the sketchy proof of the backward direction, convolving with the Riesz kernel gives 
an operator I q inverted by D[q\. This constuction is stable under perturbations. If D[q]f = 0, one 
shows that / is equal to (/ — I q D[q])f, where / is defined stably in terms of / and its first d — 1 
normal derivatives on H^. Small changes in the initial conditions thus give rise to small changes in 
/, which give rise to small changes in /, proving the theorem. 

The Riesz kernels with parameters a and a — 1 are related by a differential identity as long as 
a / 1. When a = 1, the Riesz kernel is called the fundamental solution by reason of a result 
of Garding. This is explained more fully in Theorem 12.81 below, but the short statement, found for 
instance in |Giil97[ Theorem 2.2], is that the solution of 

D[q]f(x) = £(x) 

exists, is unique, and is supported on the dual cone, C*. 



2.2 Homogeneous hyperbolic polynomials 

Many of the properties of hyperbolic polynomials are easier to state, prove and understand in the 
homogeneous case. Throughout this section, therefore, we deal only with homogeneous polynomi- 
als. We will use p rather than q as a visual cue when speaking about homogeneous polynomials. 
Hyperbolicity is easily seen to be equivalent to the equivalent to the following "real root" property. 

Proposition 2.2. The homogeneous polynomial p is hyperbolic in direction x if and only if for any 
y G M the univariate polynomial t >— > p(y + tx) has only real roots. 

Proof: Because p is homogeneous, when A ^ 0, we have p(Xz) = if and only if p(z) = 0. With 
A = is for some real nonzero value of s we see that p(x + iy) ^ for all y € K d (the definition of 
hyperbolicity) is equivalent to p{y + isx) ^ for all y € R d and all nonzero real s. This is equivalent 
to p(y + <x + isx) 7^ for all y S M d and real s, t with s ^ 0. Writing z = t + is, this is equivalent 
to p(y + zx) when z is not real. □ 

Denote by tfc(x, y) the roots of t p(y + tx). Then 

m 

p{y + tx)=p{x)l[[t-t k {x,y)}. (2.3) 

k=l 
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By Proposition 12.21 when p is hyperbolic in direction £, all values tfc(x, y) are real, hence, setting 
t = 0, we see that p/p(x) is a real polynomial. We see that little generality is lost in restricting our 
discussion of homogeneous polynomials to those with real coefficients. 

The two nontrivial examples of hyperbolic polynomials given in Garding's original paper, namely 
Lorentzian quadratics and the determinant, provide much of the intuition as to the meaning of hy- 
perbolicity. We now discuss these. A preliminary observation is that, trivially, all real homogeneous 
polynomials of degree one are hyperbolic in all directions in which they do not vanish. Next, observe 
that hyperbolicity of the homogeneous polynomial p in direction £ is preserved when p and £ are 
transformed by the same invertible real linear map. This allows us to classify all nondegenerate 
quadratics: it suffices to consider all polynomials of the form 'Y2j=i^ lX % hyperbolicity evidently 
being determined by signature. A necessary and sufficient condition for hyperbolicity is Lorentzian 
signature, that is precisely one sign different from the others. 

Example 2.3 (Lorentzian quadratics). Letp(x) := x\ — 2Z J= 2 x "j- R ea l vectors £ may be classified as 
time-like, light-like or space-like according to whether is respectively positive, zero or negative. 
The time-like vectors form the open convex cone {x : (xijx\) 2 + ••• + {xd/xi) 2 < 1}. Fix any 
time-like vector £. If r\ = A£ then the line 1 1— > r\ + i£ contains a doubled root of p at the origin. For 
any other real r\, the line r\ + tt; intersects the hyperplane orthogonal to £ at some point other 
than the origin and p takes a negative value there. On the other hand, the quadratic p(i] + i£) has 
positive leading term, so goes to +oo at t = ±oo. Hence the line intersects the zero set of p twice. 
The degree of p is two, hence for any r], the polynomial t i— > p(n + i£) has all real roots. Hence p 
is hyperbolic. Replacing p by —p does not affect hyperbolicity. Hence homogeneous quadratics with 
signature 1 or d — 1 are hyperbolic in timelike directions. 



For the converse, elliptic quadratics have lines in every direction with no real roots, hence are 
obviously not hyperbolic. If d > 4 and there are at least two positive and two negative directions, the 
any line on which p has a minimum may be translated in a positive direction not on the line so that 
the minimum becomes positive, and similarly any line on which p has a maximum may be translated 
so that the maximum becomes negative. We conclude there are no directions of hyperbolicity. 

The other classical example is as follows. 

Example 2.4 (Determinants of Hermitian matrices). Annxn matrix M is Hermitian if A'Uj = Mji. 
The space of Hermitian matrices of size n is parametrized by n 2 real parameters, these being the 
arbitrary complex numbers {.My : 1 < i < j < n} and the n real numbers {Ma : 1 < i < n}. Let 
p be the homogeneous polynomial on M™ defined by the determinant as a function of these n 2 real 
numbers. Hyperbolicity of p in the direction of any positive definite matrix A is equivalent to the 
well known fact that the zeros of M + zA are all real if M is Hermitian and A is positive definite. 
To see this, let z — a + bi with b > and write 

Det (M + zA) = Dct (M)Det \{bA)- 1/2 (aA + M){bA)- 1/2 + il 
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Figure 4: This Lorentzian quadratic is hyperbolic in the x-direction but not the y-direction. 

where (bA) 1 / 2 is the Hermitian positive definite square root of bA. If this quantity is equal to zero 
then —i is an eigenvalue of {bA)" 1 ^ 2 {aA + M){bA)~ 1 / 2 which is impossible because this matrix is 
Hermitian. 

There are several ways to construct new hyperbolic functions from old ones. Note that the first 
does not require homogeneity. 

Proposition 2.5 (products and polarization). 

(i) Let qi and q 2 be hyperbolic with respect to x. Then qiq 2 is also hyperbolic with respect to x. 

(ii) Let p be homogeneous of degree m and hyperbolic with respect to x. Let po,...,p m be the 
coefficients of p(y + fx) as a polynomial in t: 

m 

p(y + t X )=J2Pk(y)t k ■ (2.4) 

fc=0 

Then pk is hyperbolic with respect to x for all < k < m. 

Proof: The fact for products is immediate from the definition. For polarization, we first observe 
that hyperbolicity of p in direction x implies hyperbolicity of the directional derivative D[x][p] in 
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direction x. To see this, observe that f(t) having only real roots implies f'(t) has only real roots. 
Setting f(t) — p(y + tx), we see that f'(t), namely t H> £) x (p)(y + tx), has only real roots for any 
y G M. d , which is hyperbolicity of D x (p) in direction x. 

Now consider the expansion (|2.4p of p(y + tx) into homogeneous parts. The parts are given by 



Hyperbolicity in direction x is stable under D x , which shows that pk is hyperbolic in direction x. □ 
2.3 Cones of hyperbolicity for homogeneous polynomials 

Hyperbolic polynomials have associated with them certain convex cones. In the homogeneous case 
the definition is simple and is contained in the following proposition. This was proved first in |Gar51] 
and reproduced many times; the proof below follows Gi il97j . Let p be a (complex) homogeneous 
hyperbolic polynomial, hyperbolic in direction £. Dividing by a real multiple of we may assume, 
cf. (|2.3p . that p is real and p(£) = 1. 

Proposition 2.6. Let K(p,£) denote the connected component of the set M. d \ {x : p(x) — 0} that 
contains £. 

(i) p is hyperbolic in direction x for every x G K(p, £). 

(ii) The set K(p,^) is an open convex cone; we call this a cone of hyperbolicity for p. 

(Hi) K(p,£) is equal to the set K of vectors x for which all roots of t H> p(x + t£) are real and 
negative, which (by hyperbolicity of p in direction £,) is the same as the set of vectors x for 
which no root of this polynomial is real and nonnegative. 

Proof: By continuity of the roots of a polynomial with respect to its coefficients, the set K is open. 
Also £ G K because p(£ + t£) = (l+i) m p(0 where m is the degree of the homogeneous polynomial p. 
If x is in the closure K then, again by continuity of the roots, p(x + i£) ^ for t > 0; in particular, 
p(x) 7^ implies x G K, which means that K is closed in M. d \ {p = 0}. Also K is connected: by 
definition of K, if x G K then x + 1£ G K for t > 0, and by homogeneity sx + t£ G K for s, t > 0, 
and sending s to zero proves that K is star-convex at £ (hence connected). Being connected and 
both closed and open in M. d \ {p = 0}, K is a component of K d \ {p — 0} and is thus equal to K (p, £). 

Next we check that for v € if, x real, and s and t complex, 




p(x + tv + sf) ^ whenever 3(s) < 0, S(t) < 0, 3(s) + S(t) < . 



(2.5) 
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First, suppose 3 (s) < 0. If u > 1 then the polynomial 




(2.6) 



is nonvanishing for real t because this is p evaluated at the sum of a real vector and a complex 
multiple of £ with nonzero imaginary part. As u — > oo, the number of roots of (|2.6[) in the lower 
half- plane remains constant. The limit at u = oo is t > p(tv — ££). This has roots in the upper 
half- plane because its roots are — i divided by the roots of 1 1— > p(v + t£), the latter of which we have 
seen to be negative real. We conclude that for u — 1 there are no roots in the lower half-plane, in 
other words, 



To see that this also holds for 9(s) = and 3(t) < 0, note that K is open, so v — e£ £ K for some 
e > 0, whence 

p(x + tv + s£) = p(x + i(v - ef ) + (s + et)£) 
which is nonzero by (|2.7p . This completes the verification of (|2 . 5|) . 

Setting s = in (|2.5[) shows that any root of t h-> p(x + <v) satisfies > 0. Because p is real, 
complex conjugation may be applied to the entire argument, showing that also < 0, and hence 
that all roots of t h-> p(x + tv) are real. The vector v was chosen arbitrarily in K. We conclude that 
p is hyperbolic with respect to every v € K. It follows that K is star convex with respect to v as 
well, and since v G K is arbitrary, that K is convex. □ 

Because the cones of hyperbolicity are characterized as components of the nonzero set in real 
space, it follows that the cones of hyperbolicity of pq are pairwise intersections of the cones of 
hyperbolicity of p with the cones of hyperbolicity of q. 

Example 2.7 (coordinate planes). Each coordinate function Zj is homogeneous of degree 1, therefore 
hyperbolic in every direction not contained in the plane {zj — 0}. The cones of hyperbolicity are 
the two half spaces bounded by this plane. It follows that the product Ylj=i z j * s hyperbolic in every 
direction in which no coordinate vanishes, and that the cones of hyperbolicity are the orthants. This 
is also obvious from Figure [31 

Returning briefly to the origins of hyperbolicity theory in the properties of wave-like partial 
differential equations, the following result was proved in [ABG70, Theorem 4.1]. It says roughly 
that cones of hyperbolicity are propagation cones for solutions to wave-like equations. Given any 
convex cone K C K d over the origin, let K* denote the dual cone of vectors v* g (R d )* such that 
(v*,x) < for all x £ K. Recall the linear partial differential operator D[p] defined by substituting 
d/dxi for Xi in p. Let E = E(p, x) be a solution to 



p(x + tv + sf ) ^ whenever < 0, 3(i) < . 



(2.7) 



D[p]E = 5 X 



(2.8) 



where <5 X is a delta function at x. 
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Figure 5: The cones of hyperbolicity for z j are ^ ne orthants 

Theorem 2.8. Suppose the homogeneous polynomial p is hyperbolic in direction £. Then there 
exists a unique distribution E solving (|2.8p and its support is the cone K{p,^)* dual to the cone of 
hyperbolicity of p containing £. In fact E is given as in inverse Fourier transform: 

E(p^)^(2n)- d [ MyrV^dy 
where p-(y)~ 1 :— lim t ^op(y — itv) as distributions and v is any element of K(p,^) . □ 

Although there is no time here to explore the properties of hyperbolic polynomials as barrier 
functions, I will mention this example briefly, as it has proved to be of some importance to convex 
programming. 

Example 2.9 (self-concordant barrier functions). Let Q C M. d be an open convex set. A function 
F : Q — > M is a logarithmically homogeneous self- concordant barrier function for Q if it is 

smooth, convex, and satisfies several properties: 

F(x) — > oo flsx-> dQ 

|D 3 .F(x)[v,u,u]| < 2(D 2 F(x)[u,u}) 3/2 
| \DF(x)[u}\ 2 < 9D 2 F(x)[u,u] 
F(tx) = F(x)-61ogt. 

The interior point method for convex programming problems (see, e.g. [NNQSjj) is based on finding 
such a function for a given region, Q. There is a universal construction but it is not always useful 
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for computations. Also, properties beyond those satisfied by the universal construction are required 
for long term stability of the interior point method. Such properties will depend on the region Q. 
When Q is the cone of hyperbolicity for a homogeneous polynomial p, it turns out that the function 
F(x) := — logp(x) is a logarithmically homogeneous self-concordant barrier function with a number 
of other useful properties. These are detailed in JGul97l. 

Hyperbolicity without homogeneity 

Delving into the relation between hyperbolicity in the homogeneous and non-homogeneous cases, 
(definitions both given in Definition 11.11 we begin by looking at the homogeneous parts of a poly- 
nomial, /. Sorting the terms of / by total degree, the highest degree part will be denoted LT{f) 
for "leading term(s)" . The lowest degree is called the localization and will be discussed further in 
the next section. If / is hyperbolic, then both the leading term and the localization are hyperbolic 
as well (respectively [ABG70| Lemma 3.20] and [ABG70I Lemma 3.42]). By definition, the cones of 
hyperbolicity of a non-homogeneous polynomial / are the cones of hyperbolicity of LT(f). When 
/ is hyperbolic, these cones still characterize the directions of stable propagation of PDE's and 
supports of solutions to these. 

Because hyperbolicity is easier to understand in the homogeneous case, Garding looked for a con- 
verse to these and came up with a criterion for homogeneous polynomials called strong hyperbolicity. 
This is most naturally stated using the equivalent definition of hyperbolicity from Proposition 12.21 
the roots of p(y + ix) must not only be real but also distinct (except when y is a multiple of x, when 
all the zeros coincide perforce). We then have: 

Theorem 2.10. If A is strongly hyperbolic then f is hyperbolic for any f such that LT{f) = A. 

This is proved as (3.9) of [ABG7Q] and in fact is already in [Ga r51j after the statement and 
proof of Lemma 2.5. Necessary and sufficient conditions for / to be hyperbolic are given in [H6r83 ( 
Theorem 12.4.6]; these were conjectured by Garding and proved first by Svensson |Sve68j . 
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3 Semi-continuity and Morse deformations 



3.1 Localization 

Given a function / analytic on a neighborhood of the origin, its order of vanishing is the least 
total degree of a nonvanishing term in its Taylor series. The sum of all such terms is called the 
homogeneous part of / at the origin and denoted fihom(/). For any x we let 

6hom(/, x) := fihom(/(x + •)) 

denote the homogeneous part of / at x. When x ^ 0, Atiyah et al. |ABG70j define this by taking 
the t = term of t m f(t^ 1 x + y) as a function of y, where m — deg(f), and they refer to this as the 
localization of / at x. 

Proposition 3.1. Let p be any hyperbolic homogeneous polynomial, and let m be its degree. Fix 
x with p(x) = and let p := flhom(p,x) denote the leading homogeneous part of p at x. If p is 
hyperbolic in direction u then p is also hyperbolic in direction u. Consequently, if B is any cone of 
hyperbolicity for p then there is some cone of hyperbolicity for p containing B. 

PROOF: This follows from the conclusion (3.45) of ABG70, Lemma 3.42]. Because the devel- 
opment there is long and complicated, we give here a short, self-contained proof, provided by J. 
Borcea |BP111 Proposition 2.8]. If Q is a polynomial whose degree at zero is k, we may recover its 
leading homogeneous part fihom(Q) by 

13hom(Q)(y) - lim A fc <5(A _1 y) ■ 

A— >oo 

The limit is uniform as y varies over compact sets. Indeed, monomials of degree k are invariant 
under the scaling on the right-hand side, while monomials of degree fc + j scale by A -3 , uniformly 
over compact sets. 

Apply this with Q(-) = p(x + •) and y + tu in place of y to see that for fixed x, y and u, 

p(y + tu) = lim A fc p(x + A _1 (y + tu)) 

A— >QC 

uniformly as t varies over compact sub-intervals of R. Because p is hyperbolic in direction u, for 
any fixed A, all the zeros of this polynomial in t are real. Hurwitz' theorem on the continuity of 
zeros |Con781 Corollary 2.6] says that a limit, uniformly on bounded intervals, of polynomials having 
all real zeros will either have all real zeros or vanish identically. The limit p(y + tu) has degree k > 1; 
it does not vanish identically and therefore it has all real zeros. This shows p to be hyperbolic in 
direction u. □ 

Definition 3.2 (family of cones in the homogeneous case). Let p be a hyperbolic homogeneous 
polynomial and let B be a cone of hyperbolicity for p. If p(x) = 0, define 

K p ' B (x) 
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to be the cone of hyperbolicity of flhom(p,x) containing B, whose existence we have just proved. If 
p(x) / Ow define K p - B (x) to be all ofR d . 

Example 3.3 (strongly hyperbolic functions). As mentioned in passing in Section^ a homogeneous 
polynomial p is said to be strongly hyperbolic if the roots of p(y + tx), in addition to being real, are 
distinct. Equivalently, the projective variety V := {x : g(x) = 0} is smooth. In this case, for 
each nonzero x, the localization fihom(p,yi) has degree one. The cones of hyperbolicity K p ' B {x.) are 
half spaces whose common tangent hyperplane is tangent to B at x; the convexity of B implies that 
the tangent hyperplane is a support hyperplane to B at x, and we see that indeed K p ' B (x) contains 
B. 

Suppose x„ — > x. It is not in general true that 6hom(/, x„) — » Bhom(/, x). However, it is true 
that 

6hom(/, x„) = 6hom(fihom(/, x),x„) 

for n sufficiently large. This implies that if p is homogeneous and hyperbolic with cone of hyperbol- 
icity B, then K p < B {x) is semi-continuous in x: 

K p ' B (x) C liminf K p ' B (x n ) as x„ ->■ x. (3.1) 

This is proved in IABG701 Lemma 5.9]. We will want a version of this valid for polynomials that 
are not necessarily homogeneous. If q is hyperbolic but not homogeneous, the cone K q,B (x) has not 
been defined. To do this, at least for some points x, we need the notion of the amoeba of q. 

3.2 Amoeba boundaries 

Definition 3.4 (amoeba). The amoeba of any polynomial q is defined to be the image of the zero set 
V := {z : q = 0} of q under the coordinatewise log modulus map ReLog (z) := (log \zi\, . . . , log \zd\). 
We denote this image by flamoeba(q). 

The connected components of K d \ fiamoeba(q) are convex sets and are in one to one correspon- 
dence with the Laurent series expansions of F, with each expansion converging on precisely the 
set {exp(x + iy) : y e B} for some component B of \ fiamoeba(q). This is well known and is 
presented, for instance, in Chapter 6 of [GKZ94 . For those who have not seen an amoeba before, 
one of the simplest nontrivial amoebas is shown in Figure |5] 

The following result, proved in |BP111 Proposition 2.12], defines a family of cones via localizations 
on the boundary of an amoeba. 

Proposition 3.5. Let q be any polynomial and let B be a component of the complement of fiamoeba{q) 
Denote the boundary of B by dB. Fix x € dB and let f = qo exp so that f vanishes at some point 
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Figure 6: The amoeba of the polynomial 2 — x — y 

x + iy. Let f y := fihom{f,x + iy). Then each f y is hyperbolic (meaning hyperbolic in at least one 
direction) and one of its cones of hyperbolicity contains B. We denote this cone by K q ' B (y). (The 
point x is considered fixed and is suppressed from the notation.) □ 

One may extend semi- continuity for localizations beyond the homogeneous case, to the families 
in Proposition 13.51 

Theorem 3.6 ((BPTTJ Corollary 2.15]). If B is a component of fiamoeba(q) c and x g dB, then the 
family of cones K q,B (y) defined in Provosition VS . 5\ satisfies 

K q ' B (y) C liminf K q ' B {y n ) whenever y n y . (3.2) 

□ 

The fact that localizations of any polynomial to points on the amoeba boundary are hyperbolic 
allows us to give numerous examples fo hyperbolic polynomials beyond the classic ones: planes, 
quadrics, and the determinant function. For example, the homogeneous polynomial variety shown 
in Figure [7] is a localization of the famous so-called fortress generating function denominator. 

It is instructive to see what role hyperbolicity plays in Theorem 13.61 by considering the coun- 
terexample f(x, y, z) = xy + z 3 as in Figure [5] As y varies over a neighborhood U of the origin, is it 
possible to choose open convex cones {K (y) : y E U} over y in such a way that K(y) varies semi- 
continuously with y and each K(y) is, locally, a subset of {y : f{y) ^ 0}? The points y = (0, y,0) 
are forced to choose whether K(y) contains points with positive x components or negative x com- 
ponents. One of these violates semi-continuity with cones K(0, y, e) as e ! while the other violates 
semi-continuity with cones K(0,y, —e) as — e f 0. The key to avoiding this in Theorem 13.61 is that 
f y must be hyperbolic, and this cannot happen with hyperbolic functions. 

3.3 Morse deformations 

Suppose that K (y) is any family of convex cones in K d varying semi-conintuously with y in the 
sense of (|3.2j) . Let r be any vector such that each K(y) contains a vector v with r • v > 0. 
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Figure 7: Another hyperbolic homogeneous polynomial 



Proposition 3.7. Over any compact set S a section {v(y) : y G S} may be chosen continuously 
with v(y) > for all y G S. 

Proof: Given y, we may choose v in the interior of K(y) with v • r > 0, whence by semi- continuity, 
v G K(y') for all y' in some neighborhood of y which we denote J\f(y,v). We may cover the 
compact set S with finitely many of these neighborhoods, {Af(y^ k \-v^) : 1 < k < m}. Let 
: 1 < fc < m} be a partition of unity subordinate to this cover and for any y G S define 

m 

v(y) :=J2^ k) (y> (k) ■ 
fe=i 

It is clear that r • v(y) > for all y G S. For each y G S, if ^ k \y) > then vW G K(y). By 
convexity of K(y), it follows that v(y) G K (y). □ 

Two applications of this are as follows. Suppose p is a homogeneous hyperbolic function and 
let B be a cone of hyperbolicity for p. Due to homogeneity, K P ' B (Ay) = K p,B (y), whence the 
family may be described as {K p ' B (y) : y G S d ^ 1 }. Suppose r is such that each K{y) contains a 
v with r • v > 0. Because is compact, an application of Proposition 13.71 yields a continuous 

section {v(y) : y G S"^ 1 } with v(y) G K p ' B (y) and r • v(y) > 0. Extending to all of R d \ {0} by 
v(Ay) :— Av(y), we arrive at: 

Corollary 3.8. There is a vector field v(-) on KL d which is 1 -homogeneous, vanishes only at 0, is a 
section of K P ' B (-), and for some e > satisfies r • v(y) > e|y|. □ 
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Figure 8: The sheared cubic xy + z 3 



Example 3.9. If the homogeneous polynomial q is strongly hyperbolic then its zero set is the cone 
over a smooth projective hypersurface and each K(y) is a half space (see Examvle \3.3\) . The condition 
that every K(y) contain a vector v with r ■ v > is the same as requiring that r not be the outward 
normal to the bounding hyperplane of K{y). In other words, r may not be on the boundary of the 
dual cone to B. 

The second application is when B is a component of 8amoeba(q) c . In this case the family 
{/ y } is periodic with period 2tt in each coordinate. Accordingly, the cones K{y) = K q ' B {y) from 
Proposition l3.5l are indexed by a compact set, namely the torus R d /(2-7rZ) d . Applying Proposition l3.7l 
then yields: 

Corollary 3.10. If each cone K q ' B {y) in Proposition Iff. 51 contains a vector v with r • v > then 
there is a continuously varying section {v(y) : y G M. d } with v(y) € K q,B (y) and r • v(y) > for 
all y. □ 

With these vector fields in hand we have enough to carry out the programs of AB G70] and jBPllj . 
I will briefly describe the former, then go into a little more detail on the latter. Let q be a homo- 
geneous polynomial, strongly hyperbolic in the direction x. We wish to compute its inverse Fourier 
transform, which we have called the Riesz kernel, Q a . We recall that its value at y is given by 




— a 



exp(y ■ z) dz . 



{UP 
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We set r = — y and assume r not to be on the boundary of B* , the dual cone of the cone B of 
hyperbolicity of q that contains x. As we have seen in Example 13.91 this guarantees the existence of 
the 1-homogeneous vector field v(-). The sets K(y) are convex and each contains both x and v(y). 
Therefore, each contains the line segment joining x to v(y). Define a homotopy by 

H{u, t) := iu + (1 - t)x + fv(u) . 

This deforms the domain of integration x + iM. d in (|2.2[) to the cone C := {iu + v(u) : u 6 R d } while 
avoiding the set V where q vanishes. Cauchy's theorem implies that deforming the contour does not 
change the integral and therefore that 

Qa(y) = J q(z)~ a cxp(y • z) dz . 

Omitting pages of detail and skipping to the punch line, this representation allows us to factor the 
integral. Integrating radially reduces the integral to an integral over the leray cycle, which is a 
(d— l)-dimcnsional homology class in the complement of {q — 0} in CV d ~ 1 . The dimension reduction 
allows a number of explicit computations which are carried out in [ABG70, Section 7]. 



3.4 Asymptotics of Taylor coefficients 

Let F = P/Q = ^ r a r Z r be a Laurent series expansion converging on the component B of 
6amoeba(Q) c - The coefficients {a r } of F may be recovered via Cauchy's formula 

f d7 

a r = (27rz)- d ^Z- r F(Z)— (3.3) 

where T is the torus T(x) = {exp(x + iy) : y e R d /(27rZ) d } for some x e B and dZ/Z := dzi A 
• ■ ■ A dzd/ {z\ ■ ■ ■ Zd). The motivation for the evaluation or asymptotic estimation of the coefficients 
{a r } comes from analytic combinatorics, where the primary object of study is the array {a r }, which 
counts something of interest. One constructs the generating function F(Z) := ^ r a r Z r and hopes to 
identify a closed form representation of F. If {a r } satisfy a recursion, then (depending on boundary 
values) this will usually succeed. A number of examples of combinatorial interest are surveyed 
in |P W08j . In most of the cases surveyed there, the generating function F is rational. In (RW08 , 
based on a result of jSafOOj . it is shown how to embed any algebraic function of d variables as 
a diagonal of a rational function of (d + 1) variables. Thus the evaluation of Taylor coefficients 
of rational functions solves the enumeration problem for any array of numbers whose generating 
function is rational or algebraic. 

Exact evaluation of the Cauchy integral (|3.3I) is not easy but some methods are known for 
evaluating it asymptotically. In the case where the pole variety V := {z : Q(z) = 0} is smooth, a 
formula was given in |PW02j ; a coordinate free version is given in [BBBP08 . Normal self-intersection 
in V can also be dealt with }PW04) . A number of examples from combinatorics and statistical physics 
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have generating functions whose pole variety has a singularity with nontrivial monodromy. In the 
remainder of this section I will explain how hyperbolicity and the resulting deformations allow 
explicit asymptotic evaluation of some of these generating functions. 

So as to keep the conversation more concrete, we consider as a running example the generating 
function for the probability of a Northgoing diamond in a uniform random tiling of the Aztec 
Diamond. This example is taken from |BP11[ Section 4]. We have 

F ' Z) = (l-(X + X-i + Y + Y~i)Z + Z 2 )(l - YZ) = £ a ^ XTySZt 

where a rst is the probability that the domino covering the square (r, s) in the order t Aztec Diamond 
is oriented in a Northgoing direction. Figure I3T41 illustrates the Aztec Diamond shape of order 4 and 
the macroscopic features of a random tiling by dominoes of a larger Aztec Diamond (order 47). 




Figure 9: An Aztec diamond and a random tiling of another Aztec diamond 



Observe that r and s can be both positive and negative, while t is always positive and at least 
\r\ + \s\. The denominator Q(Z) is a Laurent polynomial whose zero set in (C*) d has two isolated 
singularities at ±(1, 1, 1). Near each of these, the zero sets of the two factors in the denominator 
look like a cone and a plane respectively: letting Q = JH, where J is the quadratic and H = 1 — YZ 
is the log-linear term, we have 

6hom(Q, (1, 1, 1)) = (Z 2 - ±(U 2 + V 2 ))(Y - Z) . 
The zero set of this homogeneous polynomial is shown in Figure 1101 

Let V denote the flat torus x + R d /(2?rZ) d where x is any element of B, the component of 
8amoeba(Q) c over which the series converges. Changing variables via Z = exp(z) = (e zi , e Z2 , e Z3 ) 
and dz, — tfZ/Z, then writing z = x + iy and /(•) — F o exp(x + i-), yields 

a r = (27r)- d e- r ' x J expHr • y)/(y) dy . (3.4) 
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Figure 10: The homogenized zero variety near (1,1,1) 



Up to this point the expression for a r is exact. One can show that approximating / by 6hom(/) and 
T" by R d does not change the leading asymptotic term. This is a somewhat lengthy verification and 
it is here that the conical deformation of Corollary 13. 101 is required. The upshot is that a r — E(r) 
where E is the inverse Fourier transform of fihom(/). For quadratics, the inverse Fourier transform 
is the dual quadratic. For a product of a quadratic and a log-linear function, an explicit computation 
yields jBPlll Theorem 3.9] 



where A* is the quadratic form dual to fihom(/) and I is the the log-linear factor viewed as an 
element of (R d )* in the logarithmic space. For example, when F is the Aztec generating function 
one obtains the following result, a pictorial version of which is given in Figure [TT1 

Theorem 3.11 ( BP11, Theorem 3.7]). The Northgoing placement probabilities {a rs t} for the Aztec 
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Figure 11: Scaling limit of a rjS 
Diamond are asymptotically given by 



1 / V< 2 - 2r 2 - 2s 



2 



o-rst ~ — arctan 



7T V f-2s / 

w/ien (r, s, t) is in the cone {(r, s, t) : t 2 > 2r 2 + 2s 2 } and r + s + £ is odd. 

Remark. The way the generating function is indexed, a rst = when r + s + 1 is even. Asymptotics 
in the corners of the diamond outside the inscribed circle are given by a rst — > 1 in one corner and 
a rst — > in the other three, where convergence occurs at an exponential rate as t — > oo. 
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Part II: Stability 



We now turn to the notion of stability. As we will see in Sections 14.31 and [5J the most useful 
properties of stability are closure properties: stability is preserved by a wealth of operations that are 
natural from an algebraic or probabilistic point of view. Stability is defined for general multivariate 
complex polynomials. Some of these properties or their proofs simplify when restricted to certain 
subclasses, such as polynomials whose coefficients are real or positive, or multi-affine polynomials, 
whose degrees in each variable never exceed 1. Even in these cases, however, some properties are 
proved only by going through the more general setting of complex coefficients. 

4 Stability theory in one variable 

As usual, the univariate theory is older and simpler. As with hyperbolicity, the origins of stability 
theory are in differential equations and control theory. After discussing this, we turn to combinatorial 
uses of the univariate theory. Stable generating polynomials produce coefficient sequences satisfying 
Newton's inequalities, implying, among other things, log-concavity. We end the section on univariate 
stability with a discussion of stability preserving operations and the so-called Laguerre-Polya class. 

4.1 Stability over general regions 

In this section we will begin by thinking more generally of polynomials that avoid roots in some 
region Q. We will call these ^-stable. Throughout the remainder of this paper we will let 

H := {z : > 0}. 

denote the open upper half-plane. I will always use the term "stable" to mean "H-stable, as in 
Definition 11.21 and will use "Hurwitz stable" or "fi-stable" for other notions of stability. Recalling 
G.-C. Rota's philosophical observation, one might keep in mind that the Riemann Hypothesis is 
equivalent to Q-stability where ft — {z : 3?{z} ^ 1/2}. This may seem like a strained connection, 
but in fact some of the literature we review in Section [7] is explicitly motivated by the desire to 
understand the zeta function. 
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Statistical physicists have a different motive for understanding regions free of zeros. To explain 
this, we examine some graph theoretic models, paraphrasing the exposition in [S okOlj . Let G = 
(V, E) be a finite graph and let q be a positive integer. Define a polynomial in variables {x e : e G E} 

Z(q,{x e : e G E}) Yl i 1 + x ^a(v),a(w)) 

a e—vw(EE 

where a ranges over g-colorings of the vertices, that is, all maps from V to This is 

elementarily seen to be equivalent to the alternative definition 

Z(q,{x e :eeE}):= £ q k ^ J] x e 

E'CE eeE' 

where k(E') denotes the number of connected components in the subgraph (V, E'). The last for- 
mulation makes it obvious that Z is a polynomial in q as well as in {x e }. When x e = — 1 for all e 
this specializes to the chromatic polynomial x(l)> more generally, taking x e = x for all e gives the 
bivariate Tutte polynomial. 

The polynomial Z is also the partition function for the g-state Potts model. Statistical physi- 
cists are interested in phase transitions where the behavior of the system depends non-analytically 
on its parameters. Being a polynomial, Z is of course analytic in all its parameters. Typically though, 
one is interested in infinite-volume limits such as the free energy 

f(q,x):= Urn n' 1 log Z n (q, x) 

n— >oc 

where Z n is taken on a graph G n in a family of graphs whose increasing limit is Z d . The function 
/ may fail to be analytic at (q,x) if the zeros of Z n (q,x) have a limit point at (q,x). Therefore, 
there is physical significance in keeping the zeros of the polynomial Z n out of specified regions. Such 
results are often known as Lee- Yang theorems, after |YL 52, LY52 s (see also (LS81 ). 

One such theorem was proved by D. Wagner in 2000. The reliability polynomial Rq of the 

graph G = (V, E) is the probability that the graph is connected when each edge is kept or deleted 
independently with probability x, which is a polynomial in the parameter x. Brown and Colbourn 
conjectured [BC92 that all zeros of Rg{%) are in the closed disk {z : \z — 1| < 1}. In other words, 
Rq is 2? c -stable where 2?:={z:|z— 1|<1}. This was proved for the class of series-parallel graphs 
in |Wag00| Theorem 0.2]. A simpler proof was found by Sokal [SokOll Section 4.1, Remark 3], who 
actually proved the stronger multivariate stability result. On the other hand, Royle and Sokal RS01 
showed that the original univariate conjecture is false for general graphs; they also showed that 
multivariate P c -stability of Rq holds if and only if G is series-parallel. Sokal [Sokllj has recently 
conjectured that Rq is (univariate) 2? c -stable for the complete graphs, K n . 

Hurwitz stability 

The association of zero-free regions with the term "stability" originated in ODE's and control theory. 
Its use is attributed to Hurwitz. In fact, ^-stability with Q = {z : $l{z} > 0} is sometimes called 
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"Hurwitz stability" . In ODE's, it is easy to see the physical significance of Hurwitz stability. Let M 
be a matrix and consider the linear system y' = My. Hurwitz stability is equivalent to all eigenvalues 
of M having negative real parts, which is equivalent to all homogeneous solutions decaying, and hence 
good control over the system y' — My = v(i). One may also consider discrete-time analogues, such 
as the system y(™ +1 ) — Qy^ = where Q would correspond to the exponential of M in the 
previous system. Now decay of homogeneous solutions is equivalent to ^-stability when Q is the 
complement of the open unit disk; polynomials whose zeros are all in the open unit disk are said to 
be Schur-stable. 

For a less trivial example we turn to control theory. Following !Hen91] Section 10.3], we consider 
systems which turn an input signal f(t) into an output u(t). Many systems, and in particular those 
built from networks of impedances, are not only linear but also time homogeneous, that is, the map 
/ i-> u commutes with time translation. In terms of Laplace transforms, this means that the system 
acts multiplicatively, meaning that if C denotes the Laplace transform, then such systems obey the 
law 

Cu = g-Cf (4.1) 

for some function g : [0, oo) — > K that is called the transfer function. For example an L-C-R circuit, 
with an input voltage function f(t) between the inductor and the resistor producing an output u(t) 
of current from the inductor to the capacitor, satisfies (|4.1[) with g(s) = (R + Ls + C _1 /s) _1 . 
In impedance networks the transfer function is in fact always rational, g(s) = p(s)/q(s) for some 
polynomials p and q. Generally, a system is said to be stable if bounded input produces bounded 
output. 

To check whether the system with transfer function g is stable, it suffices to test it on inputs 
of the form f(t) = e lw *, the Laplace transform of which is (£/)(s) — l/(s — iuj). Let the poles of 
the rational function g be a\, . . . , a r . The Cu = g ■ Cf has poles a\, . . . , a r , iui. We may write Cu 
in a partial fraction expansion resulting in X^=o-P://( s — a j) f° r some polynomials {pj\; here we 
have set ao := icj. Inverting the Laplace transform gives a sum Pj(t)e ajt for some collection 

{Pj} of polynomials. This is bounded if and only if each aj is either in the open left half-plane or is 
imaginary and a simple zero. But when g has an imaginary zero aj, then setting ao = aj (that is, 
taking the input to be e lajt ) produces a doubled root and an unbounded output. Hence, bounded 
inputs produce bounded outputs if and only if all poles of g lie strictly in the left half-plane. 

We conclude that stable behavior of the system corresponds to Hurwitz stability of the denomi- 
nator of the transfer function g. For example, in the L-C-R circuit above, the denominator of g is a 
quadratic with positive coefficients. The real parts of the roots are always negative, whence such a 
system always behaves stably. 

Differential equations of order d may be transformed into first order systems in d variables by 
the well known trick of representing the first d — 1 derivatives as new variables. It is not surprising, 
therefore, that Hurwitz stability also arises in the stability analysis of order-d linear differential 
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equation with constant coefficients. Given d initial conditions and an inhomogeneous term, we may 
write such an equation as 

f W +a d ^f^ + --- + a f = H (4.2) 

with initial conditions /^'(O) = bj for < j < d — 1. Assuming H to grow at most exponentially, a 
Laplace transform exists near the origin. We may take the Laplace transfrom of both sides of (|4.2f) . 
Using linearity and the rule Cf'(s) — sCf(s) — f(0 + ) and denoting g := Cf gives inductively 

£(/W)(s) = s d g(s) - s d - 1 f(0 + ) - S d - 2 f'(0+) / (d - 1} (0 + ) • 

Plugging this and the boundary conditions into (|4.2p yields the following equation for g: 

s d g( S ) - s^bo - s d - 2 h b d -! 

+ a d -i [a'-^OO " sd " 2fo o 6d- 2 ] 



+ a g(s) = CH(s) . 

Letting p(s) := s d + ad_i,s d_1 + • • • + ao denote the characteristic polynomial of the equation (|4.2p . 
and Pk(s) denote the shifted polynomial s + ad-is k ^ 1 + ■ ■ ■ + CLd-k, w e may rewrite the equation 
for g as 

d-l 

p- g = CH + b jPd-i-j 
j=o 

and hence 

£i?(s)+E3=d^W-i-j( s ) „s 
= f-r^ ■ (4.3) 

Again, poles of 5 with positive real part produce unbounded output, as do purely imaginary poles 
iu> when the driving term is taken to be e 1 "*, whose Laplace transform CH has a pole at iuj. Thus, 
again, Hurwitz stability is equivalent to bounded output on bounded input. 



4.2 Real roots and Newton's inequalities 

I will now return, permanently, to upper half-plane stability, which will be called, simply, "stability" . 
In one variable, when the coefficients of / are real, the zeros come in conjugate pairs. Stability, 
therefore, is equivalent to having only real zeros. When the coefficients are nonnegative, a strictly 
positive zero is impossible, whence stability is further equivalent to having all zeros on the negative 
half line. 

Combinatorialists have long sought to exploit the properties of stable generating functions. The 
universal problem in combinatorial enumeration is to count a family of structures indexed by one or 
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more positive integer parameters. In the case of one parameter, say n, the sequence {a n } of counts 
and the corresponding generating function f(z) := ^2 n a n z n are of fundamental interest. In the 
setting of probability generating functions, the coefficients a n are nonnegative and sum to one. In 
this case, if / has only real roots, then the distribution which gives probability a n to the value n is 
representable as the sum of independent random variables each taking the value or 1 (Bernoulli 
random variables). There may or may not be a natural interpretation for these Bernoulli variables; 
see [HKPV091 Example 23] for an example in which there can be no natural interpretation. 

Often it is intuitively plausible that the sequence {a n } is unimodal, meaning that for some 
k, we have ao < ■ ■ ■ < > a^+i > > • • • ■ The enumeration literature is littered with 

examples in which unimodality is conjectured (see for example the survey [Sta89j and the follow- 
up to this |Bre 94 ), but a proof is often elusive for the reason that there is no obvious theoretical 
framework within which to prove unimodality. There are, however some stronger properties for 
which natural avenues of proof exist. 

Definition 4.1 (log-concavity). A finite or infinite sequence {ak} of nonnegative numbers is said 
to have no internal zeros if the indices of the nonzero terms form an interval [r, s] . The sequence is 
said to be log- concave if it has no internal zeros and if 



for k G [r + 1, s — 1]. The sequence {a r , . . . , o r +fc} with no internal zeros is said to be ultra log- 
concave if 



It is immediate that ultra log-concavity implies log-concavity which implies unimodality. While 
ultra log-concavity appears to be the least natural of these properties, it was shown three centuries 
ago by Newton to follow from stability of the generating function. 

Theorem 4.2 (Newton's inequalities). Suppose f(z) := X)fc=o afe2;fc * s a rea ^ s ^ a ^ e polynomial. 
Then for 1 < k < n — 1, 



If furthermore the coefficients ak are all nonnegative then the roots of f are all nonpositive and the 
sequence is ultra log-concave. 

Proof: By Rolle's Theorem, if a univarite polynomial / has only real zeros then so does its 
derivative. This observation will be useful many times below. We use it now to deduce that 





(4.4) 
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has only real zeros. Reversing a sequence of coefficients via R(z) := z n ~ k+1 Q{l/z) also preserves 
the property of having all real roots. By Rolle's Theorem again, S{z) := (d/ dz) n ~ k+1 R(z) has all 
real roots. But S(z) is the trinomial 

and the theorem follows from the discriminant test for quadratics. □ 

A curious result in a similar vein was proved by Gurvits |Gur08[ Lemma 3.2]; the proof, which 
is a few lines of calculus, is omitted. 

Proposition 4.3. Let f be a probability generating polynomial of degree d and let C := inf f(t)/t. 
If f is stable then 

„ 1 = m >(Vf ' c - 

Equality holds if and only if f = (1 — q + qt) d generates a binomial distribution. The value [(d — 
increases to e _1 as d — > oo; for infinite series, the bound a\ > e~ 1 C holds with equality if 
and only if fit) = e^*^ 1 ) is the generating function for a Poisson distribution. □ 



There are a number of techniques that can be used to establish stability (also known, in the 
univariate case, as the real root property). In some cases one has a sequence {f n } where f n has 
degree n and the roots of f n —i interlace the roots of /„, meaning that each interval between roots 
of f n contains a root of f n ~i- If this is true, then often it is possible to prove it by induction. The 
following example from |Sta89) illustrates this. 



Example 4.4 (Hcrmitc polynomials). The Hermite polynomials are a sequence of polynomials de- 
fined by 

" {-l) k n\{2zY 



Hn{z) := Y, 



2 

The are orthogonal with respect to the Gaussian measure e~ x dx. They satisfy the recursion 

d 



H n (z) = -e z2 — (e-*X_i(z)) 



Assume for induction that H n —i(z) has n — 1 real zeros. It is clear from this that H n has n — 2 
zeros interlacing the zeros of H n -\. To see that H n also has a zero less than all the zeros of H n -\, 
observe that e~ z £f„_i tends to zero as z — > — oo, therefore e~ z i? n -i has an extreme value to the 
left of its leftmost zero, which is a zero of H n . Similarly, H n has a zero greater than the greatest 
zero of H n -i, and the induction is established. 



There is a more methodical way that often works to prove interlacing by induction. If / and 
g are real polynomials of respective degrees n and n — 1 having all real roots and the zeros of g 
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interlace the zeros of /, it may be seen elementarily that f + Xg has n real zeros for any real A. A 
simple application of this idea is to the sequence {g n } defined by the three-term recurrence 

g n +i{x) = axg n (x) + bg n -i{x) , 

which include the Chebyshev polynomials and Laguerre polynomials (a and b depending on n in 
the latter case). The same idea is the basis for a theorem independently proved by Heilmann and 
Lieb |HL72j . by Gruber and Kunz |GK71j and, in part, by Nijenhuis |Nij76| . A matching on a finite 
weighted graph or multi-graph is a subset M of the edge set such that any two edges in M are 
disjoint (share no vertex). Fix a graph G — (V, E) and let {W(e) : e £ E} be a set of nonnegative 
weights. Let aj denote the number of matchings with j edges, counted by weight, where the weight 
of a matching is the product of the weights of the edges in the matching; by convention we take 
ao : —1. In the simplest case, W(e) = 1 and aj simply counts matchings of size j, size being the 
number of edges. 

If we are allowed to set W(e) = for some edges, then the complete graph is universal. We 
therefore assume that G = K n for some n. 

Theorem 4.5 (matchings). The number of weighted matchings of K n enumerated by size is ultra 
log-concave. 

Proof: We show stability of a certain generating function, the most convenient being 

Ln/2j 

Q n (z):= £(-l)V n_!y - ( 4 -5) 

This is monic, of degree n, and is odd or even depending on n. If we show Q n is stable and has n 
distinct roots, then letting Q n {z) = R n (z 2 ) in the even case and Q n (z) = zR n (z 2 ) in the odd case, it 
follows that R n (—z) has [n/2\ negative real roots, hence is stable; it is also the generating function 
for matchings by size, hence the proof will be complete. 

Taking limits at the end, we may assume without loss of generality that the weights VF(e) are 
all strictly positive. To see that Q n is stable, let denote the generating function defined by (|4.5|) 
on the graph K n \ S. We use the recursion 



which is obvious from the definition. The induction hypothesis is that Q^ L is stable with distinct 
roots, which are interlaced by the roots of Q™' v for any 

To verify the induction, we consider the sign of Q n at the n — 1 zeros of QJJ in decreasing 
order. By the interlacing property, the sign of each Q^ v alternates, starting out positive because 
Qn' v (+oo) = +oo. Therefore, zQ^ — J2 V W(u, v)Q% v alternates, starting out negative. This implies 




(4.6) 
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the existence of n — 2 roots of Q n interlaced by the roots of Q^- But also there is one root of Q n 
to the right of every root of because Q n is negative at the rightmost root of and positive at 
+00. Similarly there is one root of Q n to the left of every root of This completes the inductive 
proof. □ 

A second method by which one can establish univariate stability is via closure properties of the 
class of univariate stable polynomials. This topic will be expanded in the next section, but for now 
we mention a result from [Brc89j, whose proof we will omit. Let (x)i denote the falling product 
x(x — 1) • • • (x — i + 1) and let (x) 1 denote the rising product x(x + 1) • • • (x + i — 1). 

Theorem 4.6 ( IhcS!)] Theorems 2.4.2-2.4.3]). Let f(x) := Y^k=o a k x>t be real stable with non- 
negative coefficients. Then the polynomials X)fe=o afc ( a ') fc an ^ Sfe=o afe ( a; ) fe are rea ^ stable as well. 
□ 



In the following example, a function from the set {1, . . . ,n} to itself is represented as a directed 
graph with an edge from j to f(j) for each j. 

Example 4.7 (functions enumerated by components). Let b(n,k) be the number of functions from 
the set {1, ...,n} to itself whose directed graph has precisely k components. Lt is elementary, 
cf. fGJM Example 3.3.28], that 



i=i v 7 



n n ~ l c{i,k) 



where c(i, k) is a signless Stirling number of the first kind. Multiplying by x k and summing over k 
gives 



/ 1 \ 

f n (x) := b (n, k)x k =£ , n n -\xf 



k=l i=l x ' 

The corresponding sum replacing (x) 1 by x l has the short closed form 

i ( n _ x ) n " = x<kX + n ^ ■ 

i=l \ l _ ' 

Evidently this has only real zeros, so applying the conclusion of Theorem \4-.6\ we see that f n is stable 
as well. 



A third method is an equivalent characterization for ultra log-concave that is due to Edrei |Edr53| . 
This relies on the notion of total positivity, a theme developed at length by Karlin Kar68] and 
for which a number of combinatorial applications are given in |Bre95j . 

Definition 4.8 (Total positivity; Polya frequency sequence). 

(i) An infinite matrix is said to be totally positive (TP) if all of its minors are nonnegative. 
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(ii) A sequence {a n } is said to be a Polya frequency sequence (PF-sequence) if the matrix A n k :— 
a„_fc is totally positive. Here we take Oj :— if i < or the sequence {a n } is finite and has length 
less than i. 

A proof of the following equivalence may be found in in |Kar68[ Theorem 5.3]. 

Theorem 4.9 (Edrei's equivalence theorem). The nonnegative sequence (do, . . . , ad) is a PF-sequence 
if and only if the generating polynomial X)fc=o a k zk is stable. □ 

Brenti |Bre94[ Section 3] points out that only the 2x2 minors are required for log-concavity, 
hence unimodality. Nevertheless, nonnegativity of all minors has a combinatorial interpretation 
which is given in |Bre95| Theorem 3.5]. This interpretation is somewhat abstract, but in special 
cases the interpretation can be more concrete. 

Example 4.10 (r-derangements) . A permutation o/{l, . . . , n} is said to be an r-derangement if all 
its cycles have length at least r. Thus a 1-derangement is any permutation, a 2-derangement is a 
classical derangement, and so forth. Let b(n,r) count the number of r-derangements of {1, ... ,n}. 
Brenti [Bre95, Corollary 5.9] shows that f n :— ^2 r= ib(n,r)z r is stable. To do so, he shows that 
a related sequence {c r (n)} is a PF-sequence, hence stable, then applies Theorem \4-6\ to recover the 
result with b(n,r) in place of c r (n). 

We end this section with one more example of a stable generating polynomial, this one taken 
from [Sta89j . Recall from Example 12.41 that if A is a real symmetric matrix and B is real positive 
semi-definite then f(z) := Det (A + zB) is a real stable polynomial. 

Example 4.11 (spanning forests enumerated by component). Let G be a finite graph with possibly 
multiple edges and let 

a k (G) :=5>(^) 

F 

where the sum is over all spanning forests F of G such that F has precisely k components, where 
"f{F) is the product of the cardinalities (numer of vertices) of the k components. The factor 0/7 
changes the eumeration from forests to rooted forests. Following 'Sta89, Proposition 4], let us see 
that the polynomial fa '■= ^k a k{G)z k is stable. Let A be the matrix whose rows and columns are 
indexed by the vertices of G with entry Ajj := deg(u) if u = v and otherwise equal to —N(i,j) where 
N(i,j) is the number of edges between i and j . Lt is known that fa = Det (A + zl). It follows that 
fa is stable. Therefore, the number of rooted forests enumerated by components is ultra log-concave, 
hence unimodal. Stanley attributes the identity fa = Det (A + zl) to Kelmans. 

4.3 The Laguerre— Polya class 

Suppose we transform a polynomial f(z) = X)l-=i a kZ k by multiplying each coefficient a k by a 
specified constant We may ask which sequences A := {A^} arc multiplier sequences, meaning 
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that the resulting operator T\ preserves the class of real stable polynomials. A classical theorem due 
to Polya and Schur |PS14j gives a complete and beautiful characterization of multiplier sequences. 

Theorem 4.12 (Polya-Schur 1914). Let A = {A„ : n > 0} be a sequence of real numbers and let T\ 
denote the linear operator defined by 

(n \ n 

k=0 J k=0 

Denote by $ the formal power series 

oo . 
k=0 

Then the following are equivalent, 
(i) A is a multiplier sequence; 

(ii) $ is an entire function and is the limit, uniformly on compact sets, of the polynomials with all 
zeros real and of the same sign; 

(Hi) $ is entire and either <fr(z) or z) has a representation 

oo 

Cz n e aoz Y[(l + a k z) 

k=l 

where n is a nonnegative integer, C is real, and ctk are real, nonnegative and summable. 

(iv) For all nonnegative integers n, the polynomial T\[(l + z) n ] is real stable with all roots of the 
same sign. 

We will not prove the general Polya-Schur theorem here, proving only some special cases later as 
we need them. Some examples and remarks will clarify its meaning and possible uses. Condition (iv) 
may be thought of as saying that the polynomials (1 + z) n are universal test cases for stability 
preserving: a multiplier sequences preserving stability of these will preserve stability of all real 
stable polynomials. 

Example 4.13 (dilation). If b is a nonnegative integer, setting A& :— b k produces the operator 
T\f{z) — f(bz). Clearly this preserves stability. The representation in (Hi) is obvious because 
$0) = e bz . 

Example 4.14 (factorials). If n > 1 is an integer then the sequence \k '■= (n)k '■= n\/(n — k)\, 
defined to vanish when k > n, produces the exponential generating function $(z) = (1 + z) n . By 
criterion (Hi) this is a multiplier sequence. Dividing by the constant n\ we see that A/; = l/(n— fc)! 
for k < n and zero for k > n defines a multiplier sequence. For real polynomials of degree n, stability 
of f is equivalent to stability of the inversion z n f(l/z), hence Afe = l/k\ defines a multiplier sequence 
on polynomials of degree n. This is true for every n, whence {l/fc!}^L * s a multiplier sequence. 
This result is due to Laguerre. 
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Example 4.15 (coefficientwise multiplication). Applying the previous example to a polynomial 
9( z ) = Sfc=o a kZ k with negative real roots shows that X)fe=o( a fc/^O zfe a ^ so has negative real roots. 
Setting X k := a k for k < n and zero for k > n we see that its exponential generating function 
$(z) = X]fc=o a kZk/k\ is of the form in {Hi) with Qt$ — 0. We conclude that {a k } is a multiplier 
sequence. In other words, if we multiply term by term the coefficient sequences of two real rooted 
polynomials, at least one of which has nonnegative coefficients, we get another real rooted coefficient 
sequence. Thus the real root property for polynomials with nonnegative coefficients is closed under 
Hadamard products. 



Multiplier sequences are related to the so-called Laguerre Polya class. An entire function is 
said to be in the Laguerre-Polya class, denoted C— V, if it is the limit, uniformly on compact subsets 
of C, of polynomials with only real zeros; the terminology goes back at least to |Sch47| . It is said 
to belong to the subclass L—Vl if it is the limit of polynomials with only real zeros all of the same 
sign. Thus Theorem 14.121 asserts that generating functions of multiplier sequences are precisely the 
functions of class L—Vl. 

A related notion to that of a multiplier sequence is the notion of a complex zero decreasing 
sequence (CZDS). Let Z c (f) denote the number of non-real zeros of /. Say that a finite or infinite 
sequence {a k } is a CZDS if for any real polynomial f{z) = X)fc=o D kZ k , 

Z c (j2a k b k z k \<Z c (f). (4.7) 

\fe=o / 

In particular, in order for {a k } to be a CZDS, a value of zero on the right of (14.7[) (no non-real zeros) 
implies no non-real zeros on the left, so any CZDS is a multiplier sequence. The converse, however, 
is not true. 

To see that there is any nontrivial CZDS, we observe that the operator z(d/dz) is represented 
by the sequence a k '■= k and can never increase the number of non-real zeros. Although not every 
multiplier sequences is a CZDS, each multiplier sequence leads to a CZDS via the following result 
going back to Laguerre. 

Proposition 4.16. Let $ G L— V have zeros only in (— oo, 0]. Then the sequence {^(k) : k > 0} is 
a CZDS. □ 



Related to the notions of multiplier sequences and CZDS is the notion of a multiplier sequence 
for the property of being nonnegative on real inputs. Say that {c k } is a A-sequence if multiplication 
of coefficients term by term preserves the property of being everywhere nonnegative: 

n n 

a k x k > for all x e R c k a k x k > for all x 6 K . 

k=Q k=0 

Note that negative values are allowed for the numbers c k . The following classical relationship is 
proved in |Wid41j . 
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Proposition 4.17. If {a^} is a CZDS then {a^ 1 } is a A -sequence. □ 

Connections abound between these notions: stability, multiplier sequences, the Laguerre-Polya 
class, A-sequences, etc., which we will not have time to survey here; the reader is referred to [CC95] 
for an introduction. A-sequences, for example, are neatly characterized by a determinant condition 
and also by the so-called Hamburger moment problem: {c„} is a A-sequence if and only if c„ = 
t n dn(t) for some measure /x on R not supported on finitely many points. At the root of much 
of the present interest about these properties is their connection to the Riemann hypothesis. An 
account is given in |CV90[ ICso03j . Finally, a small related literature on nonlinear transformations 
that preserve real stable polynomials may be of interest. It was conjectured independently by S. Fisk 
and R. Stanley that if the real polynomial ^ fc a,kz k has all real roots then so does the polynomial 
whose z k coefficient is a\ — at-icik+i- Some progress was made by McNamara and Sagan in MS10 ; 
a proof of the conjecture and more was recently found by Brandon [Brail] . 



5 Multivariate stability 

Let % denote the open upper half-plane; thus stability is equivalent to having no zeros in the region 
H • We remark on some connections to other notions of stability. For homogeneous polynomials, the 
zero set is invariant under multiplication by e 10 in each coordinate, therefore all notions of half-plane 
stability coincide. Any circular region (the interior or exterior of a circle) is transformed into % by a 
Mobius transformation, which allows the theory of ^-stable functions to be mapped to the ordinary 
theory of stable functions via a bi-rational change of variables whenever Q is the product of circular 
regions. This mapping acts nicely with respect to some aspects such as closure properties but not as 
nicely with respect to coefficient sequences. Thus, for example, it may shown that Schur-stability, 
where £1 = T> d and T> is the open unit disk, has an unexpected closure property: if / and g are 
multi-affine Schur-stable polynomials then the Hadamard product / • g is Schur-stable as well; 
here the Hadamard product of ^2$ a (S) zS an< ^ b(S)z s ^ s defined to be ^ s a(S)b(S)z s . Our 
principal interest is in "H-stable polynomials and their coefficients. We will not henceforth consider 
notions of stability other than upper half-plane stability. 

A number of the closure properties of the class of stable polynomials extend immediately from the 
univariate to the multivariate setting or are otherwise elementary. In particular, if / G C[z\, . . . , z^] 
is stable then so are the polynomials obtained by the following operations: 

(a) dilation: replacing / by f(b\Zi, . . . , bdZd), where {bj} are nonnegative constants; 

(b) permuting the names of the variables; 

(c) specialization: setting Zj equal to a constant in %; 
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(d) diagonalization: setting Zj equal to Zi\ 

(e) inversion: replacing / by f(—l/zi, z 2 , ■ ■ . , z d )\ 

(f) differentiation: replacing / by df /dzf, 

(g) limits: fn~^f uniformly on compact sets and /„ stable implies / is stable; 

see for example |Wagll| Lemma 2.4]; here, and after, in results such as the this one, we will take 
"stable" to include the zero polynomial in order not to have to make exceptions. 

We also recall that Proposition 11.31 equates stability of a real homogeneous polynomial to hy- 
perbolicity in all directions in the positive orthant. Although the definition of hyperbolicity is more 
complicated for non-homogeneous polynomials, the corresponding fact characterization of real stable 
polynomials is not. 

Proposition 5.1. A (non identically zero) real d-variable polynomial f is stable if and only if for 
all v G M. d and u G Mi.) the polynomial 1 1— > /(v + tu) has only real zeros. 

Proof: If / has a zero (z\, ■ ■ ■ , Zd) in H d , then writing zj = Vj + iuj, we see that u G and 
i is a zero of /(v + tu). Conversely, if u G and v G K d such that /(v + tu) has a root with 
nonvanishing imaginary part, we may conjugate if necessary to find a root t = a + bi with b > 0; 
then v + tu is a zero of / in % d . □ 

There is one geometric feature of functions of two or more complex variables, absent from the 
univariate theory, that is worth noting because it strengthens closure property (c). Let / be any 
function in d variables, vanishing at b = (b\, . . . , bd) and analytic in a neighborhood of b. Suppose 
that /(•, &2 5 • • • j bd) is not the zero function. Then there is a continuous multi-valued function <f> 
on a neighborhood U of (b 2 , ■ ■ ■ ,b d ) in C d_1 such that <j)(z) —> b\ as z — > (i>2, ■■■,bd) and such 
that /(0(z),z) = 0. This perturbation property follows from the fact that at least one multivariate 
Puiseux series solution exists for z\ as a function of Z2, • • • i^d which converges to b\. This leads to 
the following lemma. 

Lemma 5.2. Let Bj, 1 < j < d be open subsets of C. Suppose a polynomial f is nonvanishing on 
Y\j—l Bj . Then f is nonvanishing on J^J . =1 Bj except possibly in two cases: f can vanish on the 
product of boundaries Y[j=i ®Bj> an d f can vanish at a point (pi, ... , bd) with coordinates {bj : j G S} 
in the interior of Bj for some set S C {1, . . . ,d} provided that f vanishes identically for all values 
of those coordinates, that is, . . . , b' d ) — whenever b'j = bj for all j ^ S . 

In particular, taking Bj = % for all j, we see that stability implies no zeros in T-L d except for real 
zeros or degenerate cases. 

A further consequence of this lemma is that (c) can be strengthened to include setting Zj equal to 
a real constant. 
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Proof: Let / vanish at b = (b\, ... , bd) where bj 6 dBj for j < r and bj € for j > r. We need 
to show that the hypotheses are contradicted if k > r is a coordinate such that / does not vanish 
identically when the remaining coordinates are fixed. This follows if we perturb each coordinates 
bj,j 7^ k so as to lie in the open region Bj, by sufficiently small amounts so that the perturbation 
solution for Zk lies in B^. □ 

The above facts are generally quite elementary. Before we get to the fun stuff, there is some serious 
overhead in deriving further closure properties. One route to developing properties of multivariate 
stable functions is via the notion of proper position. This is in some sense a multivariate version 
of interlacing of roots. The development here specifically avoids this because I do not find it intuitive 
and because we can do everything we need without it. For a development incorporating the notion 
of proper position, see |Wagll| . 

5 . 1 Equivalences 

Recall from Proposition 11.31 the relation between stability and hyperbolicity: a real homogeneous 
polynomial is stable if and only if it has a cone of hyperbolicity containing the (strictly) positive 
orthant. A number of equivalent formulations of stability will prove useful. If / G C[z l7 . . . , 
is any polynomial, let m denote the maximum total degree of any monomial in / and define 
the homogenization fu of / to be the unique degree-m homogeneous polynomial in the variables 
Z\, . . . , z d +i such that . . . , Zd, 1) = f(zi, . . . , z d ). We may write explicitly ]h{z\, z d +\) = 

z™+i f(zi/zd+x, ■ • ■ , Zd/ Zd+i). The following proposition is proved in |BBL09| Section 4]. 

Proposition 5.3 (homogenization). The real polynomial f £ M[zi, . . . , Zd] is stable if and only if 
the homogeneous polynomial fu is hyperbolic in all directions v such that Vj > for 1 < j < d and 
v d +i = 0. 

If f also has nonnegative coefficients then the four properties are all equivalent: 

(i) f is stable; 
(ii) fu is stable; 

(Hi) fn is hyperbolic with respect to some vector in the nonnegative orthant; 
(iv) fu is hyperbolic with respect to every vector in the positive orthant. 

Proof: For one direction, assume fjj to be hyperbolic in every direction y with yj positive when 
j < d and yd+i = 0. For any x € K d and y € Rl, the polynomial 

*->/(x + ty) = .M(x,l)+t(y,0)) 
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is not identically zero because lim t _ i . 00 t~ m /(x + ty) is equal to /ff(y, 0) 7^ 0. It has only real zeros, 
by hyperbolicity of Jh in direction (y,0), hence i is not a root, hence /(x + iy) 7^ 0; but x £ R d 
and y S R+ are arbitrary, so this is stability of /. 

For the other direction, suppose Jh fads to be hyperbolic in a direction v for which Vj > 0, 1 < 
j < d, and Vd+i — 0. Then (x + tv) has a non-real zero t = u for some x g R d+1 . The complex 
conjugate of u is also a zero so we may assume without loss of generality that the imaginary part 
of u is positive. The vector y := x + uv lies in H x R. There are three cases. First, if yd+i > 
then dividing though by this gives a zero of / in T-L d , implying / is not stable. Second, if yd+i < 0, 
dividing yields a zero in — T~L d and conjugating, we see again that / is not stable. Finally, if yd+i = 0, 
we may use perturbation to reduce to the case where yd+i =/= 0. 

For the equivalence of the four properties in the case of nonnegative coefficients, recall that the 
cone of hyperbolicity of fn containing some vector v is the connected component in R d+1 that 
contains v of the set where fn is nonzero. By nonnegativity of the coefficients, if this cone contains 
any vector in the closed nonnegative orthant then it contains all vectors in the positive orthant. This 
establishes the equivalence of (iii) and (iv) . Equivalence between (iii) and (ii) is Proposition 11.31 
Finally, we note that (ii) implies (i) by setting Zd+i = 1 (as is allowed by the last consequence of 
Lemma [5T2]) . while (i) implies (iii) by the first part of this proposition. □ 

Remark. The perturbation argument also proves that stability of / implies stability of fihom(/), 
where fihom(/)(zi, ,,.,Zd) '■= Jh{zi, . . . ,z<j,0) is the leading homogeneous part of /; see, e.g., [COSWOil 
Proposition 2.2]. 



The special case of multi-afRne polynomials 



Suppose / is multi-affine, that is, no variable appears in any monomial with power two or higher. 
The following equivalence is taken by some to be the definition of stability in the multi-affine case. 
Its significance will become evident in the next section. 

Theorem 5.4 ( Bra07, Theorem 5.6]). Let f be real and multi-affine. Then f is stable if and only 
if the inequality 

S< x »^w >- ^iSL, {x> (5 - i) 

holds for all real x and distinct i,j < n. 

The proof given here is considerably simpler than the published proof, which is a "proper posi- 
tion" argument. It comes from the same source (P. Branden, personal communication) and begins 
with the following lemma. 

Lemma 5.5. Let Q,Re C[zi, . . . , z n ] and define P = Q + z n+ iR € C[zi, . . . , 2 n +l]- Let il C C™ be 
any connected set and let D\, D2 be closed subsets ofC with disjoint interiors, such that D1UD2 = C 
and J := D\C\ D2 a simple curve separating D\ and D%. 
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If P has no roots in fi x J and R has no roots in then P either has no roots in x D\ or has 
no roots in ft x Di. 



Proof: P having no roots in 17 x J is equivalent to — Q(z)/i?(z) ^ J for all z £ Q. Hence —Q/R 
maps f2 either to the interior of D\ or the interior of D2, hence to or D\, and the result follows. 
□ 



Proof of theorem: For the forward implication, suppose / is stable. Let ej denote the i 
standard basis vector and define G(s,t) := /(x + tei + sej). For any x £ M. d , the function G is 
stable in the variables (s, t). To see this, note that this is equivalent to / having no zeros in % with 



coordinates i and j in H] this is ruled out by Lemma 15.21 By multi-affinity, we may express G as 
a + bs + ct + dst where a, 6, c, e? arc given by partial derivatives: 

d f df d 2 f 

G(s, t) = /(x) + -L( X )s + (x)i + «-£-(x)«* . 



For bivariate multi-affine polynomials a + bs + ct + dst, stability is equivalent to ad < 6c, and the 
forward direction follows. 

For the reverse direction, use induction on n. When n = 1 the result is true because all nonzero 
real polynomials a + bx are stable. Assume the reverse result for n and suppose P = Q + z n +iR 
satisfies (|5.1j) . If Q or i? is identically zero then we are done by induction, so assume not. For any 
real x, it is clear that P(zi, . . . , z n , x) satisfies (|5.ip . hence, by induction, the polynomials Q, R and 
Q + xR are all stable or identically zero. If Q + aR = for some real a then P = (z n+ i — a)R 
which is stable. We may assume, therefore, that there is no a for which Q + aR is identically zero. 
It follows that P has no zeros in H n x R where H is the open upper half-plane. By Lemma IB~5l this 
implies that either P or P(z\, . . . , z n , — z n +i) is stable. In the latter case, by the forward direction, 
equation (|5.ip holds with the signs reversed for j = n + 1 and i < n, and hence with equality for such 
i and j. This works out to (dR/dxi)Q = (dQ / dxi)R for all i < n, that is, Q and R are multiples of 
each other, which again implies that P is stable. □ 



5.2 Operations preserving stability 

In a brilliant series of papers [BB09a, BB09b, BB10 , Borcea and Branden finished off the problem of 
extending the Polya-Schur Theorem to multivariate stability preserving maps. This work not only 
generalizes from one variable to several, but also from multiplier sequences to arbitrary C-linear 
maps. I will present here two of their results, one for multiplier sequences and one giving universal 
test functions for general linear maps, proving only those parts that will be of use later. 

As before, if A = {A r : r 6 Z^} is an array of real numbers, we denote by T\ the operator for 



MX 



which 

X r a r z r 



r *(l> r )=£' 



Also generalizing from before, we say that A is a multivariate multiplier sequence if T\ preserves 
the class of real stable polynomials. The following theorem is proved in [BB101 Theorem 1.8]. 

Theorem 5.6 (multivariate multiplier sequences). The array X is a d-variate multiplier sequence 
if and only if there are d univariate multiplier sequences X \ ■ ■ ■ , A^ such that 

A r = A« • • • A«> 

and satisfying a further sign condition: either every A r is nonnegative, or every A r is nonpositive, 
or the same holds for (— l)' r 'A. 



Proof in one direction: The easy direction, and all we will need below, is to see that the 
product of univariate nonnegative multiplier sequences is a multiplier sequence (also that nonnegative 
multiplier sequences preserve complex stability, not just real stability). Let A r = IljLi Arj where 
each sequence A^ is a univariate multiplier sequence. First assume that A^ is identically 1 for 
j > 2. Let / be a real stable polynomial of d variables. Fixing z%,...,Zd in the upper half-plane, 
the polynomial f\ := /(•, Z2, . . . , z<f) is univariate stable, and by the stability preserving assumption 
on A^, we see that T\ 1 (fi) is univariate stable. This polynomial having no zeros in the upper 
half- plane is equivalent to T\f having no zeros with z\ in the upper half- plane and the specified 
values of Za, ... , z^. Because these values were arbitrary values in the upper half-plane, this finishes 
the proof in this special case. For the general case, write T\ as the composition of operators of this 
form in each coordinate. □ 

One consequence of this is that the multi-affine part of a stable polynomial is stable. 

Corollary 5.7. Let f be a stable polynomial in d variables and let / ma denote the multi-affine part 
of f , that is, the sum of the square-free monomials in f. Then / ma is stable. 



Proof: The sequence ao = 1, di = 1, a n = for n > 2 is a multiplier sequence. By Theorem 15. 61 
the multivariate sequence defined by A r = 1 if r^ = or 1 for all j < d and A r = otherwise is a 
multiplier sequence. This multiplier sequence extracts the multi-affine part. □ 

A complete characterization of linear operators preserving stability of complex polynomials was 
proved in [BB09a. Theorem 2.3]. We will discuss here only the version of this result concerning 
operators preserving stability of multi-affine functions. We need to define what Borcea and Branden 
call the algebraic symbol Gt of a linear operator T. Their definition requires a degree bound on 
each variable. We consider here only the multi-affine case; if you compare to the original text, you 
should set the degree bound vector k to (1, . . . , 1). Let C ma [zi, ...,zj\ denote the space of complex 
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multiaffine polynomials in d variables. For a linear operator T : C ma [zi, . . . , Zd] — > C[^i, . . . , zj\ 
define its algebraic symbol to be the clement Gt G C[zi, . . . , Zd, w\, . . . , Wd] satisfying 

G T {zi,...,z d ,w 1 ,...,w d ) = Y II W 3 T II z j ■)■ ( 5 - 2 ) 

sc{i,. ..,d}j$s \jes J 

In other words, G applies T to IljLi^j + w j) by treating {wj} as constants and applying T to the 
resulting monomials in {zj}. 

Theorem 5.8 ( |BB09al Lemma 3.2]). Let T : C ma [zi, . . . , Zd] —> C[zi,...,Zd] be any C-linear 
operator. Then T preserves stability if and only one of the two following conditions hold: 

(a) There is a stable polynomial P and a linear functional a : C ma ,[zi, . . . , z d ] —> C such that 
T(f) = a(f)P; 

(b) The polynomial Gt{zi, ■ ■ ■ , Zd, Wi, . . . , w d ) is stable as a complex polynomial in 2d variables. 

Remark. This is a very powerful theorem and it is worth asking what part of it one needs to 
understand in order to understand the Borcea-Branden-Liggett theory of negatively dependent 
random variables surveyed in Section [6l The answer is that we only use one direction (sufficiency) 
and this is only to prove Proposition 16. 171 which could be proved elementarily as it concerns only 
bivariate functions. Nevertheless, because this would be messy and unenlightening, we will prove 
the sufficiency direction of Theorem 15.81 here, which we will then use to prove Proposition 16 . 1 71 The 
proof relies on a lemma proved thirty years ago by Lieb and Sokal LS81], which we will not prove 
here (the proof is half a page in [Wagll| ) . 

Lemma 5.9 ([LS81, Lemma 2.3]). Let P(z) + wQ(z) be stable in C[zi, . . . , z d , w] . If the degree of 
the variable Zj is at most 1 then the polynomial 

dQ(z) 



P(z)- 



d Zj 



is either identically zero or is stable. □ 

Proof of sufficiency: If T satisfies (a) the result is immediate, so assume T satisfies (b). Because 
w i— > — w^ 1 preserves W, we see that Gt(z, w) is stable if and only if w± • ■ ■ WdGx(z, — w _1 ) is stable; 
here — w _1 denotes (— w^ 1 , . . . , — w^ 1 ). From this, we see that if / € C[v\, . . . ,Vd] is stable and 
multi-affine, then 

(w 1 ---w d )-G T (z,-w- 1 )-f(v) = ]T T[z s ](-w) s /(v) (5.3) 

SC{l,...,n} 

is stable in C[z,w,v]. We now apply the Lieb-Sokal lemma d times: the j time we take w = wj 
and Zj — Vj. Each time, the Lieb Sokal lemma replaces Wj in (|5.3[) by —(d/dvj). The final result is 
that 

£ T[z s ]/( s >(v) (5.4) 

SC{l,...,n} 
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is stable in C[z,v]. Specializing each Vj to zero preserves stability. But this specialization yields 
T,sf (S) (°) T [ z % which is another name for T/(z). Thus stability of / implies stability of Tf, 
proving the theorem. □ 



5.3 More closure properties 

The multivariate Polya-Schur theorems of Borcea and Branden are powerful but there are yet more 
closure properties that are useful in probabilistic settings because they have specific probabilistic 
meanings. We state these now but defer the proofs to the next section. 

Let rij, 1 < j < d be the maximum degree of the variable Zj in a given polynomial / G 
C[z±, . . . , z,j\. One natural way to create a multi-affine polynomials out of / is to replace each 
occurrence of each power zj by e r (zj t i, . . . , Zj tTlj ), where e r is the r th elementary symmetric 

function. Here {zj >s : 1 < j < d, 1 < s < rij} are a collection of clone variables. Formally, define 
the polarization f p of / to be the unique polynomial symmetric separately in each set of clone 
variables Zj t \,... ,Zj^ nj such that substituting the value Zj for all the clones Zj jS yields the original 
function /. In Section [6.41 we will show that stability is closed under polarization. 

One final useful closure result is the following. The proof will be given after a probabilistic 
interpretation is given in Section [6.4.31 

Theorem 5.10 (partial symmetrization) . Let f be a d-variate complex poynomial. Fix 1 < i < j < d 
and define rf to be f with the roles of Zi and Zj swapped: 

rf(zi,...,z d ) = f(z 1 ,.. Zd) ■ 

Fix 8 £ [0, 1] and define fe-.i,j '■= (1 — 9)f + ^ r (/)- If f is stable, so is fe-,i,j- 

6 Negative dependence 

6.1 A brief history of negative dependence 

When random variables in a collection influence each other in a consistent direction, this can be 
very useful for obtaining one-sided bounds on moments and probabilities of natural events. For 
example, when all pairwise correlations are negative, the variance of the sum is bounded above by 
the sum of the variances. For this reason, the literature is sprinkled with definitions of various 
properties of positive and negative dependence, that hold in examples of interest and which imply 
useful consequences. Some of these apply to random variables taking values in R, R + ,Z or Z + . To 
keep matters simple and coherent, we will discuss only the case where the variables are binary 
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valued, that is, take values in the set {0, 1}. The joint law of d binary variables is a probability 
measure on B d := {0, l} d . 

The simplest and one of the weakest conditions is pairwise positive (respectively negative) cor- 
relation: MXiXj > (EXi)(EXj) (respectively <), which may also be written as P(Xi = Xj = 
1) > P(Xj = l)¥(Xj = 1) (respectively <). As we have just seen, this condition is strong enough 
to imply one-sided bounds on the variance of J2j Xj ■ It is not strong enough to give bounds on 
the moments. For this, one requires at least positive or negative cylinder dependence. Positive 
cylinder dependence is said to hold if 

jes jes 

for all subsets S C {1, . . . , n}. Negative cylinder dependence is the reverse inequality. Sometimes 
this is strengthened to require as well that 

jes jes 

The equation (|6.1[) immediately implies that the moments of Z ;= J^j Xj are at least what they 
would be for independent Bernoullis with the same marginals, while negative cylinder dependence 
implies the moments are at most what they would be in the independent case. Expanding the 
moment generating function <fr(A) := Ke xz in powers of Z, and then into monomials in the variables 
Xj shows that positive (respectively negative) cylinder dependence implies that for A > 0, the 
moment generating function </>(A) is at least (respectively at most) what it would be for independent 
Bernoullis of the same marginal; including the inequality for complementary cylinders (|6.2p extends 
the inequality for moment generating functions to A < 0. The bounds on moment generating 
functions transfer to concentration inequalities (see, e.g., |PS97[ Theorem3.4]). 

A yet stronger property is association. Say that the collection of random variables {X\, . . . , X d } 
is positively associated if 

E/(X l5 . . . , X d )g(X u . ..,X d )> E/(X l5 . . . , X d ){¥.g{X u ...,X d ) 

whenever f,g : B d — > K are nondecreasing functions (with respect to the coordinatewise partial 
order on B d ). Negative association cannot be defined by simply reversing the inequality because 
any function / is always nonnegatively correlated with itself. Therefore, we amend the definition 
to say that {Xi, . . . ,X d } are negatively associated if Mfg < E/ Kg whenever f,g:B d ^M. arc 
nondecreasing and there is a set S C {1, . . . , n} such that / depends only on {Xj : j £ S} while g 
depends only on {Xj : j S}. 

One of the most useful results on positive association is a result of Fortuin, Kasteleyn and 
Ginibrc FKG71j. Say that y covers z in the Boolean lattice B d if y > z in the coordinatewise partial 
order but there is no element strictly between y and z; we denote this relation by 

V ■> z, 
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If x and y both cover z and w covers x and y, we call this configuration a face of the lattice Bd- Say 
that the probability measure /i satisfies the positive lattice condition if whenever x,y,z,w form 
a face of Bd (with z and «; at the bottom and top), fi(x)fi(y) < fi(z)[i(w). It is immediate to see 
that this property for faces implies fx(x A y)(J,(x V y) > fj,(x)fi(y) for every pair x, y, hence we also call 
in (multiplicative) submodularity. What is not so obvious is the theorem now known as the FKG 
Theorem. 

Theorem 6.1 (FKG). If /i satisfies the positive lattice condition then /i is positively associated. □ 

The power of this theorem is that its hypotheses are easy to check while its conclusions are quite 
strong. In many statistical mechanical systems, ratios of probabilities over a face of Bd are easy to 
compute even if the absolute probabilities are not. The FKG theorem immediately implies positive 
association for ferro- magnetic Ising models, the random cluster model with q > 1, and a number of 
other models from physics, many of which are surveyed in [Llg85 . 

For negative dependence the situation is not as nice. The negative lattice condition (reverse 
the inequality on each face) does not imply negative association, or seemingly anything of value. 
Unlike the positive lattice condition it is not closed under the most natural of operations, namely 
integrating out one variable (probabilistically this means ignoring this variable and viewing /i as a 
measure on B n _i). As a result, it has been difficult to identify negatively dependent laws and to 
establish properties such as negative association. 

6.2 Search for a theory 

For some time now, a more satisfying theory of negative dependence has been sought. The goal was 
to find a property of laws /z on Bd which would be: 

• checkable for a handful of known or conjectured examples; 

• shown to imply consequences such as negative association; 

• closed under natural operations. 

This goal was popularized in the article [PemOOj which posed several challenges and stated many 
conjectures but contained few concrete results. Spoiler alert: for binary valued random variables, 
the property is that the probability generating function is stable; such a probability distribution is 
called strong Rayleigh; see Theorem 16.91 below. In the forthcoming lists of examples, consequences 
and closure properties, only in two cases are they not known to hold for the class of strong Rayleigh 
measures: Example 16.41 (the random cluster measure) is conjectured to be Rayleigh but known not 
to be strong Rayleigh, and the proposed closure property (jd| from Section 16.2.31 (restriction to an 
interval) is known to fail. 
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One open question remaining from PcmOO is whether the property known there as h-NLC + 
implies negative association. To define h-NLC + , let us first define an external field. Let \x be a 
probability measure on B n and let Xj > be positive real numbers, for 1 < j < d. Define a measure 
p! by 

nd ii, 

p (Xi, . . . ,X d ) = p[xi, ...,X d ) 

where Z := ^ x A x //(x) is the normalizing constant. We call p! the perturbation of p by the 
external field A. The negative lattice condition is not closed under integrating out a variable; 
we say that the hereditary negative lattice condition plus external fields holds (h-NLC + ) if the 
negative lattice condition holds for all measures obtained from p by imposing an external field and 
integrating out some of the variables. This was later shown to be equivalent to Wagner's Rayleigh 
condition Wag08| , which will discuss further in Section IfTBI We now survey the proposed examples, 
consequences and closure properties for the desired class of negatively dependent laws. 



6.2.1 Proposed examples 

Example 6.2 (conditioned Bernoullis). Let p be the law of independent Bernoulli variables {X\, . . . , Xd} 
with means pi, . . . ,pd, let k be an integer between 1 and d and let p! := (p\ Y]j—i Xj = k) be the 
conditional law of {Xj} given that they sum to k. This should be negatively dependent. Note that this 
is equal to the uniform measure on k- subsets of {1, ... ,n} under the external field Xj = pj/(l— pj). 

Example 6.3 (spanning trees). Let G — (V,E) be a finite connected simple graph and let {A e : 
e G E} be nonnegative weights. Let W be the set of subsets T C E such that the resulting (V, T) is 
connected and has no cycles. Each T £ W will have cardinality exactly \V\ — 1. Such subgraphs are 
called spanning trees of G. Let w(T) := YieeT denote the weight of T under the multiplicative 
weighting scheme determined by X. We define the weighted spanning measure /i (depending on G 
and X) by 

n(T) = W{T) 

This was proven in [Pem91] to have negative correlations and in 'FM921 to have negative association, 
and we would hope it to included under any reasonable definition of "negatively dependent" . 

Example 6.4 (random cluster measure). The random cluster model on a graph G — (V,E) with 
parameters q > and {X e : e G E} is the probability measure p, on {0, 1} E obtained by normalizing 
the weights 

w(T) = l[X^q NiT) 

e 

where T C E and N(T) is the number of connected components of the subgraph (V, T) of G. When 
q = 2 this is equivalent to the Ising model for ferromagnetism and for integers q > 2 it is equivalent 
to the Potts model, also from statistical physics (see, e.g. }Wu88IV ). When q > 1, the FKG Theorem 
immediately shows \i to be positively associated. When < q < 1, the measure is conjectured but not 
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known to be negatively associated. We would hope that the right definition of negative dependence 
would settle this conjecture. 

Example 6.5 (exclusion measures). The exclusion process is a continuous time Markov chain on 
Bd as follows. The state x 6 Bd is interpreted as a collection of particles at the sites j G {1, . . . , d} 
for which Xj — 1; the sites j for which Xj = are considered vacant. Let Xu t j\ be nonnegative real 
numbers, and independently at rate Xujy, let the values Xi and Xj swap. This is interpreted as a 
particle at one site i or j jumping to the other site. If both sites or neither site is occupied, then 
nothing happens. The exclusion process is a special case of the more general exchange process, 
where the labels of the sites are arbitrary, and in particular might be distinct rather than being drawn 
from the two element set {0, 1}. Let \i t be the law of the exclusion process at time t, starting from 
some deterministic state rj at time 0. It was conjectured that fit is always negatively associated. We 
would like the definition of negatively dependent measures to include /i. 

Example 6.6 (determinantal measures with Hermitian kernels). Let K be a dx d Hermitian matrix 
with spectrum in [0, 1]. For S C {1, . . . , d}, let w(S) denote the determinant of the submatrix K$ 
of K when restricted to rows and columns in the set S. There is a unique probability measure \i on 
subsets of {1, ... ,d} such that 

V{T:TDS} = \K S \ 

for each S . This is called the determinantal measure with kernel K . Such measures were proved 
in 'Lyo03l to be negatively associated. We would hope these measures satifsy our definition of 
negative dependence. 

6.2.2 Proposed consequences 

It was proposed in [PemOOj that the right definition of negative dependence for binary variables 
would imply the following properties, with definitions immediately to follow. 

(a) negative association 

(b) stochastically increasing levels 

(c) stochastic covering property 

(d) log-concave rank sequence 

Recall that a probability measure /iona lattice W is said to stochastically dominate a law 

u, written /i > v, if /i(A) > v{A) for every upwardly closed set A (the set A is upwardly closed if 
x G A and y > x implies y £ A). An equivalent condition is stated in terms of coupling: fj, >z v if 
and only if there is a measure Q on W x W whose first marginal is /i, whose second marginal is v, 
and which is supported on {(x,y) S W 2 : x > y}. In other words, we may simultaneously sample 



45 



from [i and v in such a way that the sample from /i is always greater than or equal to the sample 
from v. 

Given a measure /i on Bd and an event G C Bd with nonzero measure, denote by (p-\G) the 
measure ^(-)/^(- n G) obtained by conditioning on G. The conditional measure (fJ.\Xj = 1) may be 
identified with a measure on Bd-i by ignoring the j th coordinate. 

Property 6.1 (stochastically increasing levels). Say that a measure /j on Bd has stochastically 
increasing levels if (p\ Xj = k) ■< (p\ y\ = fc + 1) for each k such that f^{^2j %j — k) and 
x j = k + I) are both nonzero. 

The property of having stochastically increasing levels is not implied by negative association, as 
was hoped in [PcmOO , but is a desired consequence of the right definition of negative dependence. 

Property 6.2 (stochastic covering). Say that a measure v on a lattice W stochastically covers the 
measure [i, and denote this by if there is a coupling measure Q on W 2 with marginals v and [i, 

such that Q is supported on the set {{x,y) £ W 2 : x — y or x •> y}. In other words, v stochastically 
dominates /i but "only by one", in the sense that you can sample from /i by sampling from v and 
then flipping at most one one to a zero. 

Negative association of a measure /i implies that (n\Xj = 0) y {lA^j = 1) w hen viewed as a 
measure on Bd-i- The stochastic covering property is defined in jPPllj to hold when (/J.\Xj = 
0) > (n\Xj — 1), viewed as measures on Bd-i- This property was wrongly conjectured in PemOO] to 
follow from negative association. We would hope for it to follow from the right definition of negative 
dependence. 

Property 6.3 ((ultra) log-concave rank sequence). The rank sequence for the measure fi on Bd is 
the sequence (a^ := A*{2j = fc})o<fc<d- It was wrongly conjectured to follow from negative asso- 
ciation that the rank sequence is log-concave. The right definition of negative dependence turns out 
to imply not only log-concavity but ultra log- concavity, namely that for < k < d, equation (|4.4[) 
holds, which I repeat here for convenience. 

a l > "fc-i a k+l j- 6 3 ^ 

Q (fc-l) G + l) 

6.2.3 Proposed closure properties 

In addition to negative association, it was thought that the right negative dependence property 
would imply the following closure properties. 

(a) closure under products, projections (integrating out a variable), and limits; 
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(b) closure under external fields; 

(c) conditioning on the event Xj = k}'-> 

(d) more generally, conditioning on the event k\ < Y]- Xj < ki ; 

(e) total symmetrization: replacing /i(x) by (1/nl) X^es d / i ( 7rx ) w here it acts by permuting the 
coordinates; 

(f) more generally, evolution via the symmetric exclusion process for any finite time. 

It turns out that (jcj) is too strong, but that the class of strong Rayleigh measures is indeed closed 
under rank rescaling, (see the upcoming definition) when the rank sequence b is ultra log-concave. 

Definition 6.7 (rank rescaling). Let /i be a measure on Bd and let b := (60, . . . ,bd) be a vector of 
nonnegative real numbers. For x = (xx, . . . ,Xd) € Bd let N(x) := ^2j=i x j denote the sum of the 
coordinates. For nontriviality assume that Z := X)j=o ki£*{-^ = i} > 0. The rank rescaling of fi by 
h is the measure /Xb defined by 

Mb(x) = — — fi(x). 

6.3 The grail is found: application of stability theory to joint laws of 
binary random variables 

It is now acknowledged that the "correct" definition of negative dependence for binary random 
variables is the following definition due to [BBL09 . 

Definition 6.8 (strong Rayleigh). Say that the probability measure fi on Bd is strongly Rayleigh 
(also "strong Rayleigh") if its generating polynomial f G C[z\, . . . , defined by 

f(z) := £ 

weB d 

is a real stable polynomial. 

As a generating polynomial, / is automatically multi-affine, with real nonnegative coefficients. 
Recalling Theorem l5.41 we see that strong Rayleigh is equivalent to the inequality (|5.I I) on the mixed 
partial derivatives of / holding for all real x. This is sometimes taken as the definition of strong 
Rayleigh. 

The origin of the name comes from Wagner's Rayleigh property. This is the mixed partial 
inequality (|5.I [) but only required to hold for nonnegative real arguments. As was mentioned before, 
this is equivalent to h-NLC + , which is not hard to believe because varying x over corresponds 
to varying the external field arbitrarily. Strangely, when this is required to hold for negative values 
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of the external field as well, the property becomes more natural and robust. Because the Rayleigh 
property is equivalent to h-NLC + and is obviously a consequence of the strong Rayleigh property, 
we see immediately that strong Rayleigh implies pairwise negative correlations. This is, of course, 
just the tip of the iceberg. 

Theorem 6.9 (properties of strong Rayleigh measures). 

(i) Strong Rayleigh measures are negatively associated. 

(ii) Strong Rayleigh measures have stochastically increasing levels, satisfy the stochastic covering 
vrovertv \ 6.2\ and have ultra log-concave rank sequences as in (|6.3[) . The class of strong Rayleigh 
measures is also closed under rank rescaling by sequences (bo, ■ ■ ■ , bd) when the nonzero values bi are 
the coefficient sequence of a stable univariate polynomial X}j=n ^i z% ■ In particular, this implies closure 
of the class of strong Rayleigh measures under conditioning on {X^j -^j = ^} or — — 

(Hi) The class of strong Rayleigh measures is closed under products, projections, external fields, 
total symmetrization and the symmetric exclusion dynamics. 

As an example of the utility of this theorem, here is one of the two applications that brought the 
theory to my attention. 

Example 6.10 (concentration inequalities for Lipschitz functions). Let \i be a measure on Bd and 
let f : Bd — > K be Lipschitz with constant 1. When /i is a product measure, the martingale Wk '■— 
E(/ | cji, . . . has bounded increments \Wk ~ Wk—i | < 1- It follows from Azuma's inequality (see, 
e.g, ]AS08^ ) that f satisfies the Gaussian concentration inequality 

P(/ - Ef > a) < e - a2/(2n) . (6.4) 

Ln fact independence is not needed. The stochastic covering property was defined in \PP1 If precisely 
to imply \Wk — Wk-i\ <1. It follows that (|6.4I) holds for any strong Rayleigh measure. 

6.4 Proof of Theorem 16.91 

6.4.1 Proof of part (i) : negative association 

"Strong Rayleigh implies negative association" was the result of Borcea-Branden-Liggett that 
pointed the way to application of stable polynomial theory to the probabilistic setting. We follow 
their proof, which begins by paving the way for a reduction to the case of homogeneous measures. 

Lemma 6.11 (homogenization) . Let {Xi, ...,Xd} be a collection of random variables whose law 
is strong Rayleigh and define Xd+i '■= d — Y)j—i Xj. Then the collection {X\, . . . ,Xd+i} is strong 
Rayleigh. 
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Proof: The generating function for the collection {Xi, . . . , Xd+i} is just the homogenization of the 
generating function for {X\, . . . , Xd}. Therefore, what we need to check is that the homogenization 
of a stable generating function is stable. This is not true for all polynomials^ but for probability 
generating functions, or more generally any stable polynomial with nonnegative coefficients, this 
follows from (i) =>• (it) of Proposition 15.31 □ 

Next we recall the definition of polarization via clone variables from Section [5.3l it is the unique 
polynomial symmetric separately in each set of clone variables {zj.i, . . . , Zj tTl . } such that substituting 
in Zj for all the clones Zj, s yields the original function /. The following result does not rely on / 
having nonnegative, or even real coefficients. 

Theorem 6.12 (polarization). The complex polynomial f is stable if and only if its polarization f p 
is stable. 



To prove this we require the Grace- Walsh-Szego theorem. This result may be derived from the 
theory of stable polynomials, as shown in BB09b and streamlined in | Wag 11} Section 4]. We will be 
content here to quote this century old result; for a proof, see COSW04, Theorem 2.12] or Section 4 
of |Wagll| . 

Lemma 6.13 (GraceWalsh-Szego Theorem). Let f 6 C[zi, ■ ■ . , z#] be symmetric and multi-affine 
and letC be a convex circular region containing the points Ci, • ■ • , Cd, for instance the upper half-plane 
H. Then there exists at least one point £ G C such that /(Ci, • • • , Cd) = /(C • • • > 0- ^ 



Proof of Theorem 16.121 One direction is elementary: if f p is stable then the diagonalization 
property allows us to set z JjS = zj for all j and we deduce stability of /. For the nontrivial direction, 
suppose f p is not stable and let a :— {oj, s } be numbers in the upper half-plane such that / p (a) = 0. 
Fixing the numbers {aj jS : j > 1}, the Grace- Walsh-Szego Theorem implies the existence of a number 
ai in the upper half-plane such that /(oi, . . . , oi,a') = 0, where a' are the numbers a JiS for j > 1. 
Iterating, we arrive at numbers a,2, ■ ■ ■ , a n € % for which f(a±, . . . , a n ) = 0, showing that / is not 
stable. □ 

A measure /i on Bd is called Projected Homogeneous Rayleigh (PHR) if there is a measure 
v on B m for some m > d such that v is a homogeneous measure with the Rayleigh property and 
the projection of v to the law of the first d variables gives fi. If, furthermore, the law v is strong 
Rayleigh, we say that /i is PHSR. 

Corollary 6.14. Strong Rayleigh measures are PHSR. 



Proof: First extend the law /j, of the binary variables {X\, . . . ,Xd} by homogenization to a law 
fin on variables {Xi, . . . , Xd+i}, the last of which takes integer values. By Lemma 16.111 this is 

4 Indeed f(x) = x — 1 is a trivial example of a stable polynomial whose homogenization ///(a;, y) = x — y fails to 
be stable. 
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still in the class of strong Rayleigh measures. Next replace the integer variable Xd+i by clones 
. . • ,Xd+i,d} by polarization; the resulting measure v on Bid is strong Rayleigh by Theo- 
rem [(Tl^land its projection onto the first d coordinates is /i. □ 



Proof of negative association: To prove negative association, we use an argument due to 
Feder and Mihail |FM92j . This argument shows that homogeneous strong Rayleigh measures (thus 
by Corollary 16. 141 all strong Rayleigh measures) are negatively associated. All that is needed in the 
Feder-Mihail lemma is for this class to be closed under conditioning on {Xj — x} (which is obvious) 
and have pairwise negative correlations, which we have seen in (|5.ip . In short, negative association 
for strong Rayleigh measures is reduced to the following lemma. 

Lemma 6.15 (Feder-Mihail). Let S be a class of homogeneous measures on finite Boolean alge- 
bras (of differing sizes) which is closed uner conditioning and each of which has pairwise negative 
correlations. Then all measures in S are negatively associated. 

Assuming this lemma, the proof of negative association finishes as follows. The class of homo- 
geneous strong Rayleigh measures is closed under conditioning on the value of any variable. We 
have noted that strong Rayleigh implies negative pairwise correlations, so we conclude from the 
Feder-Mihail lemma that homogeneous strong Rayleigh measures are negatively associated. Any 
strong Rayleigh measure is a projection of a homogeneous strong Rayleigh measure, hence inherits 
the negative association property. It remains only to prove the lemma. 

Proof of Feder-Mihail Lemma: Any increasing function on Bd is a positive linear combination 
of increasing indicator functions, that is, indicator functions 1a of upwardly closed events A. It 
therefore suffices to prove negative correlation for upwardly closed events A and B depending on 
disjoint sets of variables. (Note: throughout the proof, the phrase "negatively correlated" includes 
the case of zero correlation.) 

We first prove this in the special case where A is the event {Xj = 1}. We use induction on the 
number of variables, m. When m — 2, the only nontrivial case is 1a = X\ and 1b — X2, which 
are negatively correlated by hypothesis. Now assume for induction that for all measures v € S on 
Boolean lattices Bd for d < m, for all j, and for all upwardly closed events B not depending on Xj, 
the functions Xj and 1b are negatively correlated. Let \x be a law on B m +i, and let j < m + 1 and 
B C B m +i not depending on Xj be given. We will use the inequality 

pa + (1 - p)b < p'c + (1 - p')d 

holding when p,p' , a, b, c, d € [0, 1] with p<p',a<c,b<d and a > b. To use this inequality, fix 
i < m + 1 to be determined later. Let p :— /j,(Xi = 1 \Xj = 1) and p' := /i(Xi = 1 \Xj = Q), the 
inequality p < p' holds because it is equivalent to pairwise negative correlation of Xi and Xj. We 
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let 



a 


■ = n{B\Xi 


= l,Xj 


= i); 


b 


■ = n{B\Xi 


= Q,Xj 


= i); 


c 


■= n{B\X t 


= l,X j 


= 0); 


d 


■ = v{B\X l 


= 0,X j 


= 0). 



The inequalities a < c and b < d follow because the conditional measures (/i | Xj = 1), (/i | Xi = 1) 
and (n | X l = 0) are all subject to the induction hypothesis. Finally, to ensure that a > b, we now 
choose i judiciously. Equivalent conditions for the inequality a > b are 



n(B,X i = l,X j = l) 
n(B*,Xi = l,Xj = 1) 



> 



( i{B,X i =0,X j = 1) 



H(B, X t = 1, X 3 = 1) > M (B C , X t = 1, Xj = 1) 



n{B,Xi = 0,Xj = 1) 



n(B c ,X i = 0,X j = 1) 



E(Xj | B ,Xj = 1) > E(X i |B c ,X J = l). 

Homogeneity of the measure /i implies that the sum over i of the left and right-hand sides of the 
last inequality are both equal to the deterministic value y^lf Xj. Therefore the inequality must 
hold for at least one i < m + 1 . We conclude that 



fi(B | Xj = 1) = pa + (1 - p)6 < p'c + (1 - p')d = n{B \ Xj = 0) 



Having established that single variables are negatively correlated with upwardly closed events for 
all measures in <S, we consider the general case where A and B are upwardly closed events depending 
on disjoint sets of variables. Again we use induction, assuming the result for all measures on fewer 
variables and also already possessing the result in the special case. We run the argument with 1a 
in place of Xj, thus p :— n(Xi — 1 1 A), p' := n(Xi = 1 1 A c ), and so forth. The inequality p < p' 
is a consequence of the special case. The inequalities a < c and b < d come from the induction 
hypothesis applied to the conditional measures (// 1 Xi = 1) and (n\Xi = 0) respectively. Again, 
constancy of X^i 1 -^-i forces the inequality E(A,; \B,A) > K(Xi \ B c , A) to hold for some i, finishing 
the proof. □ 



6.4.2 Proof of part (ii) : stochastic inequalities 

We begin with the stochastic covering property, which follows directly from Corollary 16.141 and the 
following lemma, first proved in [PPllj . 

Lemma 6.16 ([PPllj Proposition 2.2]). If a measure is projected homogeneous strong Rayleigh then 
it has the stochastic covering property. 



51 



Proof: Let \i be a projection of a homogeneous strong Rayleigh measure v. Without loss of 
generality, we assume that the projection is onto the last d out of m coordinates. Let vq denote 
the probability measure on B m -\ which is the ;/-law of (X±, . . . ,X m -i) conditioned on X m = 0; 
similarly, let v\ denote the probability measure on B m -\ which is the ^-law of {X\, . . . , X m -\) 
conditioned on X m = 1. Negative association of v implies that vq stochastically dominates v\. let 
B C B m be any event not depending on the m th coordinate then B, that is B = x {0, 1} for some 
-B* € B m -\\ then negative correlation of B and X m implies that i>o{B*) > v\(B*). The equivalent 
coupling formulation of stochastic domination (see the beginning of Subsection l6.2.2l) shows there is 
a random variable (X, Y) such that X has law Y has law v\ and X > Y. By homogeneity of v, 
we see in fact that X always covers Y. Projecting back to Bd, this yields a random pair (X, Y) such 
that X has law (/x | X m = 0), Y has law (/it | X m = 1), and X is always equal to or covering Y. □ 

PROOF of ultra log-concavity of the rank sequence: Next we examine the rank function. 
Let \x be a strong Rayleigh measure on Bd- The rank sequence := fi(N = k) has generating func- 
tion g(z) := Y^t=o a k zk — ffi( x T--' x )- This is a stable polynomial because it is a diagonalization of 
the stable polynomial / M , recalling property (d) at the beginning of Section [5] Newton's inequalities 
fThcorem !4.2p for univariate real stable polynomials then yield ultra log-concavity of the coefficients 
of the rank sequence. □ 

Proof of rescaling by a stable coefficient sequence: Let b , . . . , b d be a sequence of non- 
negative real numbers whose nonzero elements are an interval b ri 6 r +i, . . . , b s and are the coefficients 
of a stable polynoimal. We have seen in Example 14.151 that the sequence {bk} is a multiplier se- 
quence. It follows from Theorem 15 .61 that if /(z) ~ ^ r a r z r is a real stable function of d+ 1 variables 
then g(z) := J2r ar br d+1 z r is also stable. 

Now let \x be a strong Rayleigh measure on Bd with probability generating function By 
Lemma 16.111 the measure v whose generating function is the homogenization f u = {J^)h is also 
stable (recall that v is the law of (Xi, . . . , Xj, d — Ylj=i Xj) when (X\, . . . , Xd) has law y). With 
{bk : < fc < d} any ultra log-concave sequence as above, we have seen that g(z) := J2 T a rb rd+1 z r is 
stable. Setting Zd+i equal to 1, we see that g(z)/g(l, . . . , 1) is a real stable probability generating 
function, whence the measure described at the end of part (ii) of Theorem 16.91 is strong Rayleigh. 
□ 

PROOF of stochastically increasing levels: If the interval of support [r, s] of the ultra log- 
concave sequence b satisfies s — r = or 1, then it is automatically ultra log-concave. Taking 
s = r, we see that if the law \x of {X\, . . . , Xd) is strong Rayleigh then so is (fi\N — k) as long as 
[i(N = k) > 0; here again N :— X)j=i -^r Taking s = r + 1 we see that (/i | k < N < k + 1) is strong 
Rayleigh, provided that the event conditioned on is not null. 

Fix k such that ji(N = k) and fi(N = k + 1) are both nonzero, and let fXk '■= (ft \ N = k) and 
/ifc+i := (/x | N = k + 1). The measure \ k < N < k + 1) is strong Rayleigh. Homogenizing by 
adding the check bit Xd+i '■= k + 1 — Y^j=i x i gives a measure v on Bd+i that is also strong Rayleigh. 
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Let A C Bd be any upwardly closed set and let A* := A x {0,1}. By negative association, the 
event A* is negatively correlated with Xa+i- Thus v(A st \ Xj+i = 1) < ^(A* | A,j + i = 0), which is 
equivalent to /ifc(A) < [ik+i(A). Because A was an arbitrary upwardly closed event, this proves that 



6.4.3 Proof of part (Hi) : exclusion and other closure properties 

Most of the closure properties in part (Hi) of Theorem 16.91 are easy, following from the closure 
properties of the class of stable polynomials listed at the beginning of Section [SJ The generating 
function for the product measure f^xv is the product of functions f^-f v , whence the strong Rayleigh 
property for products of strong Rayleigh measures follows from stability of the product of stable 
polynomials. Projection corresponds to setting some variables equal to 1, whence closure under 
projections follows from specialization to real values (Lemma I5.2[) . External fields correspond to 
replacing / by c/(aizi, . . . , adZd) for positive real values of the aj, which follows from the closure of 
stable polynomials under dilations. To prove closure of the class of strong Rayleigh measures under 
total symmetrization, there are two avenues. One is to deduce this from the more difficult result 
for partial symmetrization (see below), observing that total symmetrization may be achieved by 
repeated partial symmetrization. As pointed out in [BBL091 Remark 4.5], a more direct argument 
is as follows. Let /i be a strong Rayleigh measure on Bd- The sum N :— Ylj=i has generating 
function /n(z) := f^(z, . . . , z), which is stable because it is a diagonalization of / M . The generating 
function of the total symmetrization ~p is the polarization of fjsr(z), hence is stable by Theorem l6.12l 

The key to proving closure of the class of strong Rayleigh measures under exclusion dynamics 
is to prove that a single step of partial symmetrization preserves the strong Rayleigh property. 
Let n be a probability measure on Bd, let 1 < i < j < d be indices and fix < 6 < 1. The 0-partial 
symmetrization of /i with respect to indices i and j is defined to be the measure He-,i,j whose verbal 
description is as follows. 

To sample from fie-ij, first sample from /i, then flip an independent (9-coin to decide 
whether to transpose the i and j coordinates. 

At the level of generating functions, this is Theorem 15.101 which we now recall and prove. The 
generating polynomial f^e-ij is the one described in Theorem 15. 101 by 

= (1 - 0)U + <M/ M ) 

where r operates by switching arguments i and j. The theorem states that if / is stable then so is 
(1 — 9)f + 0r(f), which when applied to / = yields closure of strong Rayleigh measures under 
partial symmetrization. We now prove the theorem. 
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Proof of Theorem 15. 101 Let /i be a strong Rayleigh measure. Assume without loss of generality 
that i = 1 and j = 2. We need to show that / Mj 6»,i,2 is stable. For every choice of complex numbers 
03, 04, . . . , ad e H, we know that ■, 03, . . . , a<j) is multi-afhne and stable and we need to show 
that fn.e,i,2{', ',03, . . . , ad) is stable. Therefore it suffices to show: 

Proposition 6.17. For all multi-affine stable functions f : C 2 — > C, the function (1 — 0)f(x,y) + 
9f[y, x) is stable. 

Remark. Because of the restriction to the multi-affine case, we know that f(x, y) = a + bx + cy + dxy, 
so it cannot be too hard to prove this! We observe that if we needed this only for real stable functions, 
it would follow immediately from the characterization that / is stable if and only if ad < be, which 
follows from the mixed partial derivative criterion (|5.ip . Indeed, the proof of Theorem l 5 . 1 01 appearing 
in |BBL09[ Section 4.4] begins with this and extends via the multivariate Obreschkoff theorem. To 
keep things self-contained, we will instead derive this from the portion of Theorem 1 5 . 81 that we have 
proved; another self-contained proof is given in [Lig09| Theorem 7]; and exhaustive description of 
this four parameter family is yet another way to finish this. 

Proof of Proposition RTTTI The operator T on C ma [a;, y] corresponding to partial symmetrization 
is defined by its action on the four basis elements l,x, y and xy: 

T(l) = 1; T(x) = (1 - 6)x + 9y; T(y) = (1 - 6)y + 6x; T{xy) = xy . 

Plugging into the definition (|5 . 2[) of Gt gives 

Gt(x, y, u, v) = xy + [(1 — 6)x + 6y]u + [(1 — 9)y + 6x]v + uv . 

We need to show this is stable. It is multi-affine and has real coefficients, so we may apply the mixed 
partial derivative test (|5.1[) . There are six unordered pairs of variables. In each case we easily verify 
that the left side of (|5.1|) minus the right side is equal to a nonnegative multiple of a square when 
< 9 < 1. For example, for the pair (x, u), we evaluate (d 2 f / dxdu) f — {d f / dx)(d f / du) to obtain 
9(y + v) 2 , while for the pair (x,y) we obtain (d 2 f/dxdy)f— (df/dx)(df/dy) = 9(1 — 9)(u — v) 2 ; 
up to the symmetries of /, these are the only two cases, therefore the proposition is proved. □ 

The final step, from partial symmetrization to exclusion dynamics, is a small one. The operators 
T$;i,j defined by [i 1— > fi0;ij generated a semigroup S of operators on the space of probability measures 
on Bd- For any set of swap rates and any t > 0, the operator mapping fi to the time t law 

of the exclusion process with swap rates A started from the measure fj, is in the closure of S. This 
is more or less self-evident; for a more formal proof, note that when only one swap rate is nonzero, 
the exclusion evolution operator is already equal to an operator Te-ij, after which one can use the 
Trotter product formula (see |BBL091 Proposition 5.1]) to write the general exclusion evolution 
operator as a limit of products of those with only one nonvanishing swap rate. This completes the 
proof of Theorem 16.91 □ 
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To finish the discussion of probabilistic applications, we discuss examples. The theorem explic- 
itly addresses exclusion measures (Example I6.5|) . Conditioned Bcrnoullis (Example I6.2[) are strong 
Rayleigh due stability of 1 — p + pz for < p < 1, closure under products, (yielding all measures 
with independent coordinates), and closure under conditioning on N = k. Random cluster measures 
with q < 1 are conjectured to be Rayleigh and known not to be strong Rayleigh and spanning trees 
are a special case of determinantal measures, so among the examples 16 . 2 1 [6U1 it remains only to show 
that determinantal measures are strong Rayleigh. We will require a result which is the analogue 
of Example 12.41 but for stability instead of hyperbolicity. The exposition is taken from [BBL09 , 
Proposition 3.2] though the result itself has been know for a long time. 

Proposition 6.18. Let A%, . . . A d be (complex) positive semi- definite n x n matrices and let B be a 
Hermitian matrix, also n x n. 

1. The polynomial 

(zi, . . . , z<j) h-> f(zi, ...,Zd):= Det (z 1 A 1 H h z d A d + B) 

is either identically zero or it is real stable. 

2. If B is also positive semi-definite then the f has all nonnegative coefficients. 

3. It follows that if Z :— diag(zi, . . . , Zd) and B is any positive semi-definite d x d matrix, then 
Det (B + Z) is a multi-affine real stable polynomial with all nonnegative coefficients, hence equal 
to cfp_ for some strong Rayleigh measure pi. 

Proof: Stability is closed under limits, so it suffices to prove the proposition in the case where 
the matrices Ai, . . . , A d are positive definite. The polynomial / is real on real inputs because the 
determinant of a complex Hermitian matrix is real. Pick a vector u £ Mr; and v £ M. d . Define a 
function z by z(t) := v + tu. The matrix P := X)jLi v j^-j * s positive definite, hence invertible with 
a positive definite square root, call it Q. We may write 

f(z{t)) = Det (P) Det (tl + QHQ*) 

where H := B + X^=i u j^-j 1& Hermitian. Thus f(z(t)) is a polynomial in t that is a constant 
multiple of the characteristic polynomial of a Hermitian matrix, hence has all real zeros. Because 
v G M d and u £ R J are arbitrary, the criterion for stability in Proposition 15.11 is satisfied, proving 
the first statement of the theorem. 

Expanding the determinant, we find that the coefficients of / are products of principal minors 
of the matrices A\, ... , A d and B. When these are all positive semi-definite, the principal minors 
are positive, proving the second statement. The last statement follows immediately from setting Ai 
to the matrix with a 1 in the (i, i)-entry and zeros elsewhere, noting that this matrix is positive 
semi-definite. □ 
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Proof that determinantal measures are strong Rayleigh: Assume first that the Hermitian 
kernel K is invertible. It is easily seen that the generating polynomial / M is given by 

fn(z u . . . , z d ) = Det (I - K + KZ) = Dct (K) Dct (K^ 1 - I + Z) , 

where Z := diag(zi, . . . ,Zd). Because K is also a contraction, K^ 1 — I is positive semi-definite. 
The constant Det (K) is positive, so it follows from Example 1 2 . 4 1 that / M is stable. The general case 
may be obtained by taking limits because positive definite kernels are dense in the space of positive 
scmi-dcfinitc kernels. □ 

7 Further applications of stability: determinants, perma- 
nent s and moments 

From the outset, determinants have been prominent in the theory of hyperbolic polynomials. Already 
in |Gar51[ Example 2] , the determinant function on the space of Hermitian matrices was given as 
an example of a hyperbolic polynomial, with the nonnegative definite matrices being a cone of 
hyperbolicity (see Example 12.41 above). Related to the determinant but more enigmatic is the 
permanent. The definition of the permanent, 

n 

Per (A) := {| fl vW > 

creS n »=1 

differs from that of the determinant only in that there is no alternating sign factor (— l)P arlt y ( CT ) in 
the summand. This makes the permanent much less tractable than the determinant. For example, 
it is =#P-hard to compute the permanent of a zero-one matrix, while determinants may be evaluated 
in polynomial time. In this section we discuss two results on permanents and one on determinants. 
The first of these is a lower bound on the permanent, conjectured by van der Waerden in 1926, 
proved independently by Egorychev and Falikman in 1981, and re-proved in a simpler and more 
general way by Gurvits in 2009 using the theory of stable functions. The second of these results, 
the Monotone Column Permanent Conjecture asserts the stability of a certain polynomial obtained 
as a permanent. It was conjectured in 1999 and proved in 2009. The third result, the so-called 
BMV conjecture, is only tangentially related to stability theory, but has garnered enough attention 
to mandate its inclusion here. It was conjectured in 1975. A proof posted recently to the arXiv is 
believed to be correct. 

7.1 The van der Waerden conjecture 

One prolific area of research in understanding the permanent has been to identify extremal cases 
for various families of matrices. Some motivation for understanding permanents comes from graph 
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theory. If A is the incidence matrix of a bipartite graph G, then Per (A) is the number of perfect 
matchings of G. This interpretation has led to an emphasis on extrema over zero-one matrices. For 
example, one might consider matrices of zeros and ones with prescribed row sums. An upper bound, 
conjectured by Mine and proved by Bregman Bre73 is as follows. 

Theorem 7.1 (Bregman's Theorem). Let A be a nonnegative n x n matrix of zeros and ones and 
let f\i • ' • ) T n denote the row sums of A. Then 

n 

Per {A) < Y[{rj\) 1/r i . 

□ 



The lower bound is of course zero because there could be a column of zeros. If we prescribe 
the column sums as well as the row sums, can we find a nontrivial lower bound? Removing the 
restriction to zero-one matrices, this question was posed by van der Waerden in 1926 for doubly 
stochastic matrices. A matrix A — (aij)i<i j<„ is said to be stochastic if it has nonnegative entries 
and all row sums Yl^ ay are equal to 1 . The terminology comes from the fact that these matrices 
are precisely the transition kernels for Markov chains. The matrix A is said to be doubly stochastic 
if all column sums ay are equal to 1 as well. One might not see at first that the permanent of 
a doubly stochastic matrix must be nonzero, but this follows from the well known fact that doubly 
stochastic matrices are positive linear combinations of permutation matrices (and such matrices have 
permanent equal to 1). 

Intuition may suggest that the minimum occurs when all entries are equal to 1/n. Indeed, van 
der Waerden conjectured in 1926 that if A is doubly stochastic then 

Per(A)>^ (7.1) 

with equality if and only if ay = 1/n for all This theorem was proved thirty years ago, 

independently by Egorychev |Ego81| and Falikman |Fal81j . Recently, Gurvits |Gur08j gave a different 
proof using stability. Not only does this represent a considerable simplification, but the result 
is general enough to imply several other well known results. Gurvits begins by identifying the 
permanent as a coefficient of a polynomial, as follows. If A = (ay) is any n x n matrix, we may 
define a homogeneous polynomial p by 

n n 

p(x) = p A (x) := IJ ai i x 3 ■ 
i=ij=i 

The permanent of A is then the (1, . . . , 1) coefficient of p. Thus, 

Per(A) = ^-,(0,...,0). (7.2) 
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When A is stochastic, each factor in the product evaluates to 1 at (1, ... , 1). Taking the derivative 

with respect to Xj and evaluating at (1, . . . , 1) gives the j column sum, hence if A is doubly 
stochastic then 

^- Pj4 (1,...,1) = 1, l<j<n. (7.3) 

Let C n be the class of homogeneous polynomials of degree n in n variables with all coefficients 
nonnegative. Following Gurvits, we extend the definition of the term "doubly stochastic" from 
matrices to C n by saying that p is doubly stochastic if (|7.3[) holds. 

To prove the van der Waerden result, we need to derive Per (A) > n\/n n from (|7.3I) . This 
will not be true for every p 6 C n satisfying (|7.3[) but Gurvits' idea was that it should hold for all 
stable p £ C„. Because stability is closed under product and each linear polynomial with positive 
coefficients is stable, we see immediately that pa is always stable. The other ingredient in Gurvits' 
proof is to strengthen the induction, replacing the lower bound of n\/n n by (n!/n n )Cap (p), where 
Cap (p) is a constant that evaluates to 1 when p = pa for a doubly stochastic matrix A. 

Definition 7.2. Let C n be the class of homogeneous polynomials of degree n in n variables with all 
coefficients nonnegative. For p £ C n , define the capacity of p by 

Cap (p) := mf — =j 

lli=l x i 

where the infimum is over nonnegative values of the variables Xi . 

Proposition 7.3 ( |Gur08( Fact 2.2]). If p is doubly stochastic then Cap (p) = 1. 

Proof: We may view p € C n as a generating function for a probability distribution p supported 
on A„ := {u 6 Z™ : \u\ = n}. Letting E denote expectation with respect to \i we see that (|7.3[) is 
equivalent to Kujj = 1 for all j. 

For x € M" , we may evaluate p(x) = log E ■ x^ 3 . Applying Jensen's inequality to the concave 
function log(x) gives 



logp(xi,...,x n ) = logEj| 



3 = 1 



> 



= E uij log Xj . 

3 

Because Ecjj = i for all j, this last quantity is just logl^ 2 ^- Thus, on the positive orthant, 
p( x ) — Ilj=i x j showing that Cap (p) > 1. Setting Xj — 1 for all j shows that Cap (p) < 1. □ 

Recalling the lower bound on /'(0) for univariate stable polynomials with nonnegative coefficients 
from Proposition ^. 31 it is not hard to envision some kind of induction. The engine of Gurvits' proof, 
used in the induction step, is the following inequality. 



58 



Lemma 7.4 ( Gur08j Theorem 4.10]). Let p 6 C n be stable and defint 

d 

q(x l , . . . ,x„_i) := -q^-VK^x-, ■ ■ ■ ,x„_i,0) 



Then 

Cap(g) >Cap(p)- (7.4) 
where d is the maximum degree of x n in p. 

Proof: Letting xi, . . . , x„_i range over positive numbers whose product is 1, we need to show that 

q(xi, . . . ,x n -i) > Cap(p)G(d) 

where d < n is the maximum degree of x n in p and G(m) := [(m— l)/m] m_1 . Fix xi, . . . , x n _i. The 
specialization property implies that the univariate polynomial R(t) := p{x\, . . . ,x n —i,t) is stable. 
By definition of capacity, Cap (p) < inf t>0 R(t)/t. The degree of R is equal to d, whence it follows 
from Proposition 14. 31 that 



□ 



q(xi,...,x„_i) =R'(0) > Cap(p)G(d). 

We may now give Gurvits' result implying the van der Waerden conjecture. 
Theorem 7.5. Let p be stable with nonnegative coefficients. Then 

fc^- s - P (0,...,0)>C,p( P )£. 
Proof: Let q n — p and in general define 

Qn — i 

%(xi,...,x,) := — p(x 1 ,...,x i ,0...,0). 

dx i+ i ■ ■ ■ dx n 

Stability is closed under differentiation and specializing to real values, hence by induction on n — i, 
each qi is stable. Also, qi € C% for each i. Applying Lemma 17.41 with qi in place of p and i in place 
of n shows that 

Cap(g i+ i) > Cap (qi)G(i) 
because i is an upper bound for the degree of Xi in g,-. Inductively, we see that 

n 

Cap(gi) >Ca P (p)TjG«. (7.5) 

i=2 

The right-hand side may be identified as 



Y[G(i) = n\/n n . (7.6) 
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To identify the left-hand side, observe that qi(x±) is homogeneous of degree 1 in x%, that is, q\ = cx\. 
ThuS 

Cap (<zi) = c = q[(0) - d n , Qx P(0, • • • , 0) ■ (7.7) 
Plugging (17.71) and (17.6[) into (|T. 5[) proves the theorem. □ 

Another result that succumbs to this method is the Schrijver- Valiant conjecture, proved in 1998. 
This concerns an integer version of doubly stochastic matrices. Let A(/c, n) denote the collection of 
n X n matrices whose entries are nonnegative integers and whose rows and columns all sum to k. 
Define 

A(fc, n) = k~ n min{Per (A) : A E A(k, n)} 

and 

9{k) = lim X(k,n) 1/n . 

n— >oo 

In 1980 it was proved [SV80 that 8(k) < G(k) and equality was conjectured. The proof by Schri- 
jver |Sch98j was difficult. This result is a corollary of Theorem 17.51 For details on this, as well as a 
separate application of Theorem 17.51 to prove a lower bound on the mixed discriminant, see (Gur08 . 



7.2 The Monotone Column Permanent Conjecture 

Say that the n x n matrix A is a monotone column matrix if its entries are real and weakly 
decreasing down each column, that is, esij > a>i+i,j for 1 < i < n — 1 and 1 < j < n. Let J n denote 
the n x n matrix of all ones. It was conjectured in [HOW99 that whenever A is a monotone column 
matrix, the univariate polynomial Per (zJ n + A) has only real roots. A proof was given there for the 
case where A is a zero-one matrix. 

Although there was strong intuition already present, the proof had to wait for the development 
of multivariate stable function theory. In 2009, Branden, Haglund, Visontai and Wagner BHVW09 
were able to prove this conjecture by proving something stronger, namely multivariate stability. 

Theorem 7.6 (Multivariate Monotone Column Permanent Theorem (MMCPT)). Let Z n be the 

diagonal matrix whose entries QT6 the n in dctermin cites zx, . . . , z n . Let A by an n x n monotone 
column matrix. Then Per (J n Z n + A) is a stable polynomial in the variables z\, . . . , z n . Specializing 
to Zj = z for all j preserves stability, hence the original conjecture follows. 

I will give only a brief sketch of the proof, which relies on two results proved by Borcea and 
Branden. One of these is the criterion, Theorem 15.81 for a linear operator on multi-affine poly- 
nomials to be stability preserving. They use the sufficiency direction, namely that stability of 
the (2n)-variable "algebraic symbol" Gr(z,w) implies that T is a stability preserving operator on 
C ma [zi, • • • , z<i]l this is the direction for which a proof was included above. The other lemma they 
require is the following result. 
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Lemma 7.7 ( BHVW09, Proposition 2.5]). Let V be a real vector space and let tj> : V n — > R be a 
multilinear form. Let ei, . . . , e n , V2, ■ ■ ■ , V n be fixed vectors in V and suppose that the polynomial 

(p(ei,v 2 + z 2 e 2 , ...,v n + 

is not identically zero in M.[z2, ■ ■ ■ , z n ]. Then the set of all v\ G V for which the polynomial 

(f)(vi + ziex,...,v n + z n e n ) 

is stable is either empty or a convex cone over the origin containing ±e±. □ 

By means of this lemma, the MMCPT may be reduced to the special case where the entries of 
A are all zero or one. I will not reproduce this reduction here, but the main idea is an induction. 
If we have proved the result in the case where k columns are real monotone and n — k are zero-one 
monotone, then applying the lemma to one of the n — k columns shows that stability is achieved 
over a convex cone containing all zero-one monotone columns, and such a cone necessarily contains 
all real monotone columns. 

After a change of variables to yj = 1 + l/#j and the introduction of new variables x%, . . . ,x n , 
stability of Per(zj + a !3 -) will follow if we show stability of Per (a^y^ + (1 — aijXi) for monotone 
zero-one matrices. Denote this last permanent as Per (5(A)). Expanding the permanent on the 
last row gives a differential recurrence relation which may be written in form fj = Tfj-i where 
fj is the permanent of the upper j x j submatrix of B(A). Here T is an operator of the form 
k + Zj Y^i=\ 9/dzj. The sufficiency criterion reduces the task to checking stability of the algebraic 
symbol, Gt, which may be accomplished by a simple computation. 

7.3 The BMV conjecture 

In 1975, Bessis, Moussa and Villani BMV75 formulated what is now know as the BMV conjecture. 

Conjecture 7.8 (BMV). Let A and B be Hermitian n x n matrices with B positive semi-definite. 
Then the function 

A H> Tr cxp (A - XB) 

is the Laplace transform of a positive measure on [0, 00); here, Tr (•) denotes the trace. 

This seems vaguely tied to a number of the themes we have been discussing, but not necessarily 
related in a direct way to the theory of stable functions. In fact, both the conclusion and the 
hypotheses will reveal more upon further scrutiny. Beginning with the obvious, we recall that the 
linear cone of positive semi-definite matrices is a cone of hyperbolicity for the determinant function 
on the space of Hermitian matrices. Thus if we take the determinant instead of the trace of the 
exponential, the resulting function A H> Dct (A — XB) is stable. 



Gl 



Recall from Edrei's Equivalence Theorem (Theorem 14. 9p that stability of the polynomial / := 
J2j = o a j z ^ is equivalent to a sequence of inequalities, which can be summed up by saying that {a^} 
is a Polya frequency sequence. Similarly, a function <f> on [0, oo) is a Laplace transform of a positive 
measure on R + if and only is a completely monotone function, meaning that the derivatives 
of 4> do not change sign on [0, oo) and alternate in sign: for each nonnegative integer n and positive 
real t, 

(-U"|>) >0. 

This result is the Bernstein- Widder Theorem |Ber28j . Evidently the conclusion is a property whose 
definition has features in common with stability, a property we know to be true for A t— > Dct (A—XB). 

In 2004, Lieb and Seiringer |LS04| found an equivalent formulation of the BMV conjecture that 
looks even more similar to stability theory. 

Theorem 7.9. The BMV conjecture is equivalent to the polynomials 

Tr (A + \B) n 

having nonnegative coefficients for all integers n > 1 and all pairs of matrices A, B that are both 
positive semi-definite. 

The exponential is gone, the coefficient on A is positive, only integer powers are involved, and 
what is needed is a countable sequence of inequalities: all coefficients in a sequence of polynomials 
must be nonnegative. Another equivalence brings the BMV conjecture directly into the realm of 
stable function theory. Let p(x,y) be any stable bivariate polynomial with nonnegative coefficients 
and nonzero constant term. Let a mn be the Taylor coefficients of (dp/dx)/p. The BMV conjecture 
is equivalent to the proposition that (— l) m+n a mn > for all m, n. 

In July, 2011, a proof of the BMV conjecture by H. Stahl Stall) was posted on arXiv. At the 
time of writing, the refereed publication has not appeared, but word on the street is that the proof 
should be correct. The proof is sufficiently complicated that understanding its specialization to the 
3x3 case is considered a good project, perhaps for a dissertation. There is a sense that there may 
be a simpler proof out there, and that the "book proof" of the BMV-Stahl Theorem is likely to 
involve stable polynomials. 
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